Recognition Patents (Class 704/231)

Neural network (Class 704/232)

Detect speech in noise (Class 704/233)

Normalizing (Class 704/234)

Speech to image (Class 704/235)

Specialized equations or comparisons (Class 704/236)

Creating patterns for matching (Class 704/243)

Voice recognition (Class 704/246)

Word recognition (Class 704/251)

Audible command modification

Patent number: 11210059

Abstract: A method and system for modifying an audible command is provided. The method includes continuously receiving audible commands associated with a context of interactions between a user and individuals. The audible commands are analyzed with respect to associated actions and user attributes of the audible commands are identified. Specified information required for executing each command of the audible commands and portions of the specified information associated with specified individuals of the individuals are determined. Digital audio samples of the user are retrieved and assigned to the portions of the specified information with respect to each command. The associated actions are modified with respect to the specified individuals and self-learning software code comprising the modified actions is generated and executed such that the commands are executed with respect to the modified actions.

Type: Grant

Filed: June 25, 2019

Date of Patent: December 28, 2021

Assignee: International Business Machines Corporation

Inventors: Craig M. Trim, Garfield W. Vaughn, Shubhadip Ray, Sarbajit K. Rakshit
Evidence aggregation across heterogeneous links for intelligence gathering using a question answering system

Patent number: 11204929

Abstract: Mechanisms, in a Question Answering (QA) system comprising a processor and a memory, for evaluating a hypothetical link in an ontology are provided. An initial analysis of the ontology is performed to identify a set of information concept entities and links between information concept entities in the ontology. The hypothetical link between a first information concept entity and a second information concept entity in the ontology is generated based on the initial analysis of the ontology. Natural language questions corresponding to the hypothetical link are processed to generate answer results directed to a plurality of links between a plurality of information concept entities. The answer results are aggregated across the plurality of links to determine an aggregate answer result for the hypothetical link. An indication of whether or not the hypothetical link is a valid link is output based on the aggregate answer result for the hypothetical link.

Type: Grant

Filed: November 18, 2014

Date of Patent: December 21, 2021

Assignee: International Business Machines Corporation

Inventors: Darryl M. Adderly, Corville O. Allen, Robert K. Tucker
Voice profile updating

Patent number: 11200884

Abstract: Techniques for labeling user inputs for updating user recognition voice profiles are described. A system may leverage various signals, generated during or after processing of a user input, to retroactively determine which user spoke the user input. For example, after the system receives the user input, the user may provide the system with non-spoken user verification information. Based on such user verification information, the system may label the previously spoken user input as originating from the particular user. The system may also or alternatively use system usage history to retroactively label user inputs.

Type: Grant

Filed: November 6, 2018

Date of Patent: December 14, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Sundararajan Srinivasan, Arindam Mandal, Krishna Subramanian, Spyridon Matsoukas, Aparna Khare, Rohit Prasad
System and method for intelligent language switching in automated text-to-speech systems

Patent number: 11195510

Abstract: Systems, methods, and computer-readable storage media for providing for intelligent switching of languages and/or pronunciations in a text-to-speech system. As the system receives text, the text is analyzed to identify portions which should have speech constructed using a pronunciation distinct from the remaining portions of the text. The text-to-speech system uses multiple pronunciation dictionaries to generate and produce speech corresponding to the text, where the identified portions of the text are in a different language or have a different accent from the remainder of the text. Having generated speech corresponding to the text in multiple languages, accents, or dialects, the system combines the portions, then communicates the speech to the text recipient.

Type: Grant

Filed: August 16, 2019

Date of Patent: December 7, 2021

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Gregory Pulz, Harry E. Blanchard, Lan Zhang
System enabling audio-based navigation and presentation of a website

Patent number: 11188199

Abstract: A website navigation system has an analysis system which receives a request for an Internet web page from a client device. The analysis system receives web page data associated with the web page from the Internet and performs a data analysis process to organize the web page data for use in a virtual conversation with the user in order to present the web page in an audible format. The analysis system identifies separate elements of the web page from the web page data and extracts information from the separate elements based on the web page data. The analysis system groups the separate elements into categories based on the extracted information and sorts the groups of separate elements based on usage statistics. The analysis system then generates a prompt for being output to the user by the client device as audible output based on the sorted groups of separate elements.

Type: Grant

Filed: April 16, 2018

Date of Patent: November 30, 2021

Assignee: International Business Machines Corporation

Inventors: Florian Pinel, Donna K. Byron, Christian Ewen, Carmine Dimascio, Benjamin L. Johnson
Electronic device, control method thereof, and sound output control system of the electronic device

Patent number: 11188290

Abstract: Provided are an electronic device, a control method thereof, and a sound output control system of the electronic device, for example, a technique for controlling sound that is output from an electronic device located in the same space as a voice recognition device.

Type: Grant

Filed: October 17, 2019

Date of Patent: November 30, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jung Su Kim, Ho Jin Eo
Communication device

Patent number: 11190632

Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.

Type: Grant

Filed: October 8, 2020

Date of Patent: November 30, 2021

Inventor: Iwao Fujisaki
Vehicle and control method thereof

Patent number: 11189276

Abstract: A vehicle includes a communication device configured to communicate with a terminal capable of providing a communication function; a sensor configured to receive voice of a user; a storage configured to store a user pattern related to a call pattern of the user; and a controller configured to search for at least one name candidate corresponding to input voice when receiving the input voice, determine a threshold for a confidence score of the at least one name candidate based on the user pattern, and select a name corresponding to the input voice from among the at least one name candidate based on the determined threshold.

Type: Grant

Filed: February 1, 2019

Date of Patent: November 30, 2021

Assignees: Hyundai Motor Company, Kia Motors Corporation

Inventor: Kyung Chul Lee
Communication device

Patent number: 11184469

Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.

Type: Grant

Filed: October 8, 2020

Date of Patent: November 23, 2021

Inventor: Iwao Fujisaki
Communication device

Patent number: 11184468

Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.

Type: Grant

Filed: October 8, 2020

Date of Patent: November 23, 2021

Inventor: Iwao Fujisaki
Communication device

Patent number: 11184470

Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.

Type: Grant

Filed: October 8, 2020

Date of Patent: November 23, 2021

Inventor: Iwao Fujisaki
System and method for interpreting data transfer from a voice recognition front end

Patent number: 11176941

Abstract: A method and system for interpreting data transfer from a voice recognition platform. The voice recognition platform data transfer may include a designator, device identification, a command or query, and a plurality of variables. The voice recognition string from the voice recognition platform may include the designator, the query, and at least one variable of the plurality of variables. Further, the method may include generating an instruction string from the recognition string. The instruction string may include the designator, the query and the at least one variable of the plurality of values. The method may include removing extraneous symbols from the instruction string to generate a cleaned instruction string comprising the designator, a cleaned query, and a cleaned variable of the at least one variable of the plurality of variables. The method may include searching a context platform database for data corresponding to the cleaned query.

Type: Grant

Filed: October 28, 2019

Date of Patent: November 16, 2021

Assignee: Connected Living Technology, LLC

Inventors: Sarah Hoit, Brian McWade, Josiah Strandberg
Translation of verbal directions into a list of maneuvers

Patent number: 11175154

Abstract: Natural language directions are received and a set of maneuver/context pairs are generated based upon the natural language directions. The set of maneuver/context pairs are provided to a routing engine to obtain route information based upon the set of maneuver/context pairs. The route information is provided to an output system for surfacing to a user.

Type: Grant

Filed: November 20, 2018

Date of Patent: November 16, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Elizabeth P. Salowitz, David Grochocki, Jr., Jeff West
Contextual multi-channel speech to text

Patent number: 11170765

Abstract: A method for improving a transcription may include identifying, in the transcription, reliable channel tokens of an utterance of a reliable channel and an unreliable channel token of an utterance of an unreliable channel, and generating, using a machine learning model, a vector embedding for the unreliable channel token and vector embeddings for the reliable channel tokens. The method may further include calculating vector distances between the vector embedding and the vector embeddings, and generating, for the unreliable channel token and using the vector distances, a score corresponding to a reliable channel token. The method may further include determining that the score is within a threshold score, and in response to determining that the score is within the threshold score, replacing, in the transcription, the unreliable channel token with the reliable channel token.

Type: Grant

Filed: January 24, 2020

Date of Patent: November 9, 2021

Assignee: Intuit Inc.

Inventors: Oren Sar Shalom, Yair Horesh, Alexander Zhicharevich, Elik Sror, Adi Shalev, Yehezkel Shraga Resheff
Information processing apparatus, information processing system, and information processing method

Patent number: 11172082

Abstract: An information processing apparatus includes circuitry that receives, via a communication network, a first user request input in voice to a terminal, and reflects the first user request in a type or setting of a job. When a second user request input in voice to the terminal after the first user request is received via the communication network during the reflection of the first user request, the circuitry displays, on a display, information of the type or setting of the job reflecting a previous user request preceding the second user request. The previous user request includes the first user request.

Type: Grant

Filed: September 26, 2019

Date of Patent: November 9, 2021

Assignee: RICOH COMPANY, LTD.

Inventor: Shun Yoshimi
Contact management

Patent number: 11163905

Abstract: A system for data sharing between a first user and a second user includes a memory and a processor. The processor is configured to execute instructions stored in the memory to associate a unique identifier with a first profile of the first user, the first profile includes user data; obtain, from a second device of the second user, a sensed identifier; and, in response to the sensed identifier matching the unique identifier of the first user, execute instruction to send, to the first user, a first request to share first user data of the first user with the second user; and receive, from the first user, a response to the first request to share the first user data of the first user with the second user. The sensed identifier is captured by a sensor of the second device.

Type: Grant

Filed: December 30, 2019

Date of Patent: November 2, 2021

Assignee: Ginko LLC

Inventors: Ronald J. Czajka, Sam B. Attisha
Medical report coding with acronym/abbreviation disambiguation

Patent number: 11152084

Abstract: Techniques for coding a medical report include identifying an acronym or abbreviation in the medical report, and a plurality of phrases not explicitly included in the medical report that are possible expanded forms of the acronym or abbreviation in the medical report. From the plurality of phrases, a most likely expanded form of the acronym or abbreviation may be selected by applying to the medical report a statistical acronym/abbreviation expansion model trained on a corpus of medical reports. By applying to the medical report with the expanded acronym or abbreviation one or more statistical fact extraction models, a clinical fact may be extracted from the medical report based at least in part on the most likely expanded form of the acronym or abbreviation in the medical report, and a corresponding medical taxonomy code may be assigned to the extracted clinical fact from the medical report.

Type: Grant

Filed: February 16, 2016

Date of Patent: October 19, 2021

Assignee: Nuance Communications, Inc.

Inventors: Ravi Kondadadi, Girija Yegnanarayanan, Brian William Delaney, John Ortega
Routing natural language commands to the appropriate applications

Patent number: 11152009

Abstract: In a voice controlled system, multiple applications are configured to respond to various commands. The voice controlled system includes client devices and servers. The correct application to receive a natural language command is identified based on how well the command matches functions of the application. A target application to receive the command may additionally be selected based on which application is most likely to receive a command. Likelihood of an application receiving a command may be determined by considering context. The command may be a voice input to a client device that is analyzed by speech recognition technology to determine word strings representing possible commands. Thus, the selection of a target application to receive the command may be based on word strings from the natural language input, a closeness of fit between the command and an application, and/or the likelihood an application is the target for the next incoming command.

Type: Grant

Filed: August 14, 2017

Date of Patent: October 19, 2021

Assignee: Amazon Technologies, Inc.

Inventor: Jeffrey Penrod Adams
Automated document analysis comprising company name recognition

Patent number: 11138377

Abstract: At least two processing device-implemented company name recognition components, operating upon a body of text in a document, identify at least one company name occurrence in the body of text based at least in part on a company identifier list. The company name recognition techniques implemented by each of the at least two company name recognition components are different from each other. The at least one company name occurrence is used to update the company identifier list. The updated company identifier list is then used by the at least two company name recognition components to identify at least one additional name occurrence in the same body of text. This process of repeatedly identifying occurrences of company names in the body of text and updating the company identifier list is performed until such time that no further company name occurrences are identified in the body of text.

Type: Grant

Filed: December 30, 2019

Date of Patent: October 5, 2021

Assignee: Freedin Solutions Group, LLC

Inventors: David A. Cook, Andrzej H. Jachowicz, Phillip Karl Jones
Automatic media device input scrolling

Patent number: 11138976

Abstract: Devices, systems, and methods are provided for automatic media device input scrolling. The system may receive voice data associated with a first device. The system may determine, based on the voice data, an input of the first device. The system may determine an active input of the first device. The system may determine a number of inputs from the active input to the input. The system may send one or more instructions based on the number of inputs.

Type: Grant

Filed: September 30, 2019

Date of Patent: October 5, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Samuel Thomas Bailey, Bernardo De Carvalho e Silva, Mirosla Nadj, Damjan Majstorovic
Locally distributed keyword detection

Patent number: 11138975

Abstract: In one aspect, a playback device includes at least one microphone configured to detect a voice input and generate sound input data. The playback device detects a first command keyword in the detected sound and, in response, makes a first determination, via a first local natural language unit (NLU), whether the input sound data includes at least one keyword within a first predetermined library of keywords. The playback device receives an indication of a second determination made by a second NLU that the input sound data includes at least one keyword from a second predetermined library of keywords. The playback device compares the results of the first determination and the second determination and, based on the comparison, foregoes further processing of the input sound data.

Type: Grant

Filed: July 31, 2019

Date of Patent: October 5, 2021

Assignee: Sonos, Inc.

Inventors: Nick D'Amato, Connor Kristopher Smith
Robust expandable dialogue system

Patent number: 11132499

Abstract: An automated natural dialogue system provides a combination of structure and flexibility to allow for ease of annotation of dialogues as well as learning and expanding the capabilities of the dialogue system based on natural language interactions.

Type: Grant

Filed: August 28, 2018

Date of Patent: September 28, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Percy Shuo Liang, David Leo Wright Hall, Jesse Daniel Eskes Rusak, Daniel Klein
On-device custom wake word detection

Patent number: 11132992

Abstract: Generally discussed herein are devices, systems, and methods for on-device detection of a wake word. A device can include a memory including model parameters that define a custom wake word detection model, the wake word detection model including a recurrent neural network transducer (RNNT) and a lookup table (LUT), the LUT indicating a hidden vector to be provided in response to a phoneme of a user-specified wake word, a microphone to capture audio, and processing circuitry to receive the audio from the microphone, determine, using the wake word detection model, whether the audio includes an utterance of the user-specified wake word, and wake up a personal assistant after determining the audio includes the utterance of the user-specified wake word.

Type: Grant

Filed: July 25, 2019

Date of Patent: September 28, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Emilian Stoimenov, Rui Zhao, Kaustubh Prakash Kalgaonkar, Ivaylo Andreanov Enchev, Khuram Shahid, Anthony Phillip Stark, Guoli Ye, Mahadevan Srinivasan, Yifan Gong, Hosam Adel Khalil
Information processing device, information processing method, and non-transitory computer readable storage medium

Patent number: 11132999

Abstract: An information processing device according to the present application includes an extraction unit and a subsequent stage generation unit. The extraction unit extracts a last conversation of a feedback utterance estimated to indicate a predetermined reaction of a second utterance subject relative to an utterance made by a first utterance subject, from a set of a plurality of conversations, based on a score assigned to the feedback utterance. The subsequent stage generation unit generates a subsequent stage classifier for deriving an index indicating a category of an unknown conversation, based on the last conversation extracted by the extraction unit.

Type: Grant

Filed: September 4, 2018

Date of Patent: September 28, 2021

Assignee: YAHOO JAPAN CORPORATION

Inventors: Chikara Hashimoto, Manabu Sassano
Occupancy sensing systems and methods

Patent number: 11125907

Abstract: The present disclosure provides systems and methods for improved occupancy sensing. The methods and systems can deploy various signal threshold adjustments and/or signal analysis algorithms in response to sensed signals having a given quality, such as exceeding a threshold. In some cases, signal thresholds are lowered following an initial generated signal exceeding a first, higher threshold. In some cases, time-dependent signals are monitored using algorithms that analyze the signals for variations that are characteristic of human usage. Methods are disclosed for determining if two motion sensors are observing the same or overlapping spaces. Systems and methods for calibrating motion sensing systems are also disclosed.

Type: Grant

Filed: May 17, 2019

Date of Patent: September 21, 2021

Assignee: Steelcase Inc.

Inventors: Michael Bloem, Marcus Ward, Mychal Hall
Organizational-based language model generation

Patent number: 11120788

Abstract: Provided is a system and method for acquiring training data and building an organizational-based language model based on the training data. In one example, the method may include collecting organizational data that is generated via one or more applications associated with an organization, aggregating the collected organizational data with previously collected organizational data to generate aggregated organizational training data, training an organizational-based language model for speech processing based on the aggregated organizational training data, and storing the trained organizational-based language model.

Type: Grant

Filed: June 27, 2019

Date of Patent: September 14, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Cem Aksoylar, Michael Levit, Xin Meng, Shuangyu Chang, Suyash Choudhury, Dhiresh Rawal, Tao Li, Rishi Girish, Marcus Jager, Ananth Rampura Sheshagiri Rao
Automatic speech recognition

Patent number: 11120793

Abstract: It is depicts a method of speech recognition, sequentially executed by a processor on consecutive speech segments that comprises: obtaining digital information, which is a spectrogram representation, of a speech segment, and extracting from it speech features that characterizes the segment from the spectrogram representation. Then, a consistent structure segment vector based on the speech features is determined onto which machine learning is deployed to determine at least one label of the segment vector. A method of voice recognition and image recognition sequentially executed by a processor, on consecutive voice segments is also described. A system for executing speech, voice, and image recognition is also provided that comprises client devices to obtain and display information, a segment vector generator to determine a consistent structure segment vector based on features, and a machine learning server to determine at least one label of the segment vector.

Type: Grant

Filed: June 11, 2017

Date of Patent: September 14, 2021

Assignee: VoicEncode Ltd.

Inventor: Omry Netzer
Speech recognition with image signal

Patent number: 11114101

Abstract: A method of speech recognition and person identification based thereon, comprising: recording speech from a speech signal using a microphone; illuminating a speaking mouth; recording a degree of light reflected by the mouth from a reflection signal using a sensor; and recording combined parameters of the speech signal and of the reflection signal, and coupling them to letters associated therewith, per predetermined time duration; comparing a combination occurring in speech of parameters of the speech signal and of the reflection signal to the recorded combined parameters of the speech signal and of the reflection signal which are coupled to letters; and deciding on the basis of the comparison to which letter the combination occurring in the speech of parameters of the speech signal and of the reflection signal corresponds, using block-width modulation of the reflection signal.

Type: Grant

Filed: January 25, 2019

Date of Patent: September 7, 2021

Assignee: IEBM B.V.

Inventors: Olaf Petrus Quirinus Mossinkoff, Johannes Leonardus Jozef Meijer
Automatic speech recognition triggering system

Patent number: 11102568

Abstract: An automatic speech recognition (ASR) triggering system, and a method of providing an ASR trigger signal, is described. The ASR triggering system can include a microphone to generate an acoustic signal representing an acoustic vibration and an accelerometer worn in an ear canal of a user to generate a non-acoustic signal representing a bone conduction vibration. A processor of the ASR triggering system can receive an acoustic trigger signal based on the acoustic signal and a non-acoustic trigger signal based on the non-acoustic signal, and combine the trigger signals to gate an ASR trigger signal. For example, the ASR trigger signal may be provided to an ASR server only when the trigger signals are simultaneously asserted. Other embodiments are also described and claimed.

Type: Grant

Filed: April 26, 2019

Date of Patent: August 24, 2021

Assignee: APPLE INC.

Inventors: Sorin V. Dusan, Aram M. Lindahl, Robert D. Watson
Voice assistant device and method thereof

Patent number: 11100935

Abstract: Embodiments of present disclosure relates to a voice assistant device and method for controlling the voice assistant device. The voice assistant device comprising receiver configured to receive at least one voice input from user, when operated in wake-up mode. Intent associated with the at least one voice input from the at least one user. Further, probability of issuance of a subsequent voice input from the at least one user is determined based on at least one of the intent, historic data and one or more contextual factors. An extended wake-up duration of the voice assistant device is estimated, when the probability is greater than a predefined threshold value. Further, duration of the wake-up mode is extended for the extended wake-up duration to receive the subsequent voice input from the at least one user.

Type: Grant

Filed: June 5, 2019

Date of Patent: August 24, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventors: Vijaya Kumar Tukka, Chethan Konanakere Puttanna, Deepraj Prabhakar Patkar, Sulochan Naik, Harish Bishnoi
Device and method for performing functions

Patent number: 11099812

Abstract: Provided is a device including a display, an audio inputter, and a controller. The display displays at least one screen page of an application that is being executed. The audio inputter receives a voice command of a user. The controller performs an operation corresponding to the voice command by using screen page transition information for transition between application screen pages corresponding to the voice command, which is obtained from information about user interface (UI) elements included in the application screen pages of the application. Each of the UI elements performs a predetermined function when selected by the user.

Type: Grant

Filed: August 4, 2020

Date of Patent: August 24, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Han-min Bang, Hyok-sung Choi
Determining a performance criterion for a wireless device

Patent number: 11096073

Abstract: Techniques are disclosed for determining a performance criterion for a client device. A performance criterion for a client device may be determined based on a rate of mobility of a client device. Additionally or alternatively, a performance criterion for a client device associated with a particular attribute may be determined based on performance levels of a set of client devices associated with the same particular attribute. The performance criterion is used to evaluate a performance level of a client device. If the performance criterion is not satisfied, then a wireless configuration is modified to improve the performance level.

Type: Grant

Filed: February 15, 2016

Date of Patent: August 17, 2021

Assignee: Facebook, Inc.

Inventor: Subbu Ponnuswamy
Speech recognition results re-ranking device, speech recognition results re-ranking method, and program

Patent number: 11087745

Abstract: To provide a speech recognition results re-ranking technology for re-ranking speech recognition results so as to render speech recognition results suitable for intended use of speech recognition while reducing preparation costs required prior to execution of re-ranking processing of speech recognition results. A speech recognition results re-ranking device includes: a speech recognition unit 210 that generates a speech recognition result set with recognition score from speech data; and a re-ranking unit 220 that generates a speech recognition result set with integrated score from the speech recognition result set with recognition score by using a word vector expression database, a cluster center vector expression database, and a normalized knowledge information word DF value database.

Type: Grant

Filed: December 19, 2017

Date of Patent: August 10, 2021

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Takashi Nakamura, Nobuaki Hiroshima, Setsuo Yamada
Spatial audio signaling filtering

Patent number: 11089405

Abstract: An apparatus comprising: an analyser configured to analyse at least one input to determine one or more expression within the at least one input; and a controller configured to control at least one audio signal associated with the at least one input dependent on the determination of the one or more expression.

Type: Grant

Filed: March 14, 2012

Date of Patent: August 10, 2021

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Roope Olavi Jarvinen, Kari Juhani Järvinen, Juha Henrik Arrasvuori, Miikka Vilermo
Systems and methods for voice-based initiation of custom device actions

Patent number: 11087752

Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.

Type: Grant

Filed: August 22, 2018

Date of Patent: August 10, 2021

Assignee: Google LLC

Inventors: Bo Wang, Subbaiah Venkata, Chad Yoshikawa, Chris Ramsdale, Pravir Gupta, Alfonso Gomez-Jordana, Kevin Yeun, Jae Won Seo, Lantian Zheng, Sang Soo Sung
Voice recognition method, apparatus, device and storage medium

Patent number: 11087763

Abstract: A voice recognition method is provided by embodiments of the present application. The method includes: obtaining a voice signal to be recognized; and recognizing a current frame in the voice signal using a pre-trained causal acoustic model, according to the current frame in the voice signal and a frame within a preset time period before the current frame, the causal acoustic model being derived based on a causal convolutional neural network training. In the method provided by the embodiments of the present application, only the information of the current frame and the frame before the current frame is used when performing the recognition of the current frame, thereby solving a problem in voice recognition technologies based on prior art convolutional neural network where a hard delay is created because there is a need to wait for the frames after the current frame, improving the timeliness of the voice recognition.

Type: Grant

Filed: December 28, 2018

Date of Patent: August 10, 2021

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Chao Li, Weixin Zhu, Ming Wen
System and method for dynamic speech recognition selection based on speech rate or business domain

Patent number: 11087766

Abstract: A dynamic speech processing system and method is provided. The system includes a receiver configured to receive a plurality of audio files. The audio files include sample training audio files and run-time audio files. The system further includes a speech processor coupled to the receiver and configured to compute a variable value for a specific audio file. The speech processor is configured to dynamically select a set of relevant speech recognition engines for a specific run-time audio file based on the variable value.

Type: Grant

Filed: March 8, 2018

Date of Patent: August 10, 2021

Assignee: Uniphore Software Systems

Inventors: Sachdev Umesh, Pattabhiraman Thiyagarajasarma, Gopalakrishnan Gururaghavendran
Voice actuated data retrieval and automated retrieved data display method

Patent number: 11086863

Abstract: Methodologies are provided for generating, organizing, storing and retrieving medical records using voice recognition in combination with unique codes assigned to data elements, and include microprocessor and memory, such as non-transient computer readable medium, having stored thereon a database including vocabulary terms. Methods include receiving spoken language via a speech recognition interface, and generating on a display an output according to vocabulary terms uniquely associated with the spoken language. Data stored in the database can include records organized into specific modules having specified vocabulary terms synced with each module and unique computer code to key vocabulary terms in the database. Using an associated unique code can cause specific data field to open on display when recognizing specific spoken word or phrase by the speech recognition interface.

Type: Grant

Filed: December 23, 2019

Date of Patent: August 10, 2021

Inventor: Jeffrey E. Koziol
Interaction method and apparatus

Patent number: 11081108

Abstract: Embodiments of the present disclosure disclose an interaction method and apparatus. A specific embodiment of the method includes: generating, in response to determining that a request input by a user satisfies a guiding condition, guiding information, and feeding back the guiding information to the user, the guiding condition including one of the following: associating with a plurality of query intents, or associating with no query intent; and generating, based on the request and a feedback input by the user corresponding to the guiding information, an intent-clear request, and feeding back push information bound with the intent-clear request to the user. Realizing that in the process of interacting with the user, for conditions such as the request input by the user is associated with a plurality of query intents or incompleteness, an intent-clear request associated with an explicit query intent is determined through the interaction with the user.

Type: Grant

Filed: June 28, 2019

Date of Patent: August 3, 2021

Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co. Ltd.

Inventors: Mengmeng Zhang, Zhongji Fan, Lei Shi, Li Wan, Qiang Ju, Chao Yin, Wei Shen, Jian Xie, Ran Xu, Jingya Wang
Contextual spoken language understanding in a spoken dialogue system

Patent number: 11081106

Abstract: A spoken dialogue system includes a spoken language understanding apparatus. The spoken language understanding apparatus can include an intent apparatus and a selection apparatus. The intent apparatus is configured to determine if a query comprises a global command, to determine if an intent associated with a query is or is not included in a domain that is supported by the spoken dialogue system, to determine if a query comprises a confirmation type, to tag one or more entities in a query, and to determine an intent probability distribution and a domain probability distribution that is associated with a query. When the query includes an entity that is included in two or more possible entities, the selection apparatus is configured to provide a score for each of the two or more possible entities.

Type: Grant

Filed: August 25, 2017

Date of Patent: August 3, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Xihui Lin, Andrew James McNamara, Jing He
Understanding user sentiment using implicit user feedback in adaptive dialog systems

Patent number: 11074913

Abstract: Various embodiments are provided for understanding user sentiment in a dialog system in a computing environment by a processor. A sentiment of a user may be detected according to a sentiment analysis and user feedback during a dialog with the user. One or more reasons for the sentiment of the user may be identified. Behavior of the dialog system may be adjusted according to the one or more reasons.

Type: Grant

Filed: January 3, 2019

Date of Patent: July 27, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Oznur Alkan, Adi I. Botea, Elizabeth Daly, Matthew Davis, Christian Muise
Speech recognition using convolutional neural networks

Patent number: 11069345

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

Type: Grant

Filed: December 18, 2019

Date of Patent: July 20, 2021

Assignee: DeepMind Technologies Limited

Inventors: Aaron Gerard Antonius van den Oord, Sander Etienne Lea Dieleman, Nal Emmerich Kalchbrenner, Karen Simonyan, Oriol Vinyals, Lasse Espeholt
Apparatus, server, and method for providing conversation topic

Patent number: 11048750

Abstract: A conversation topic providing method includes: converting voice data, of a conversation of a user who is on a phone, into text; selecting a keyword, indicating an intention of the user, from the text; obtaining information of interest with respect to the keyword; and determining topics relating to the keyword based on user information.

Type: Grant

Filed: August 5, 2014

Date of Patent: June 29, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Hue-yin Kim, Sang-il Lee, Sung-kyu Lee, Seong-seol Hong, Jung-hoon Shin, Yeon-woo Lee
System and method for neural network orchestration

Patent number: 11043209

Abstract: Methods and systems for training one or more neural networks for transcription and for transcribing a media file using the trained one or more neural networks are provided. One of the methods includes: segmenting the media file into a plurality of segments; extracting, using a first neural network, audio features of a first and second segment of the plurality of segments; and identifying, using a second neural network, a best-candidate engine for each of the first and second segments based at least on audio features of the first and second segments. A best-candidate engine is a neural network having a highest predicted transcription accuracy among a collection of neural networks.

Type: Grant

Filed: January 8, 2019

Date of Patent: June 22, 2021

Inventors: Peter Nguyen, David Kettler, Karl Schwamb, Chad Steelberg
Speech signal processing and evaluation

Patent number: 11043212

Abstract: There is disclosed a system that, when in operation, evaluates speech, for example evaluates a speech signal generated using a microphone to record an oral utterance.

Type: Grant

Filed: November 29, 2018

Date of Patent: June 22, 2021

Inventor: Peter Bell
System and method for detection and correction of incorrectly pronounced words

Patent number: 11043213

Abstract: A system and method are disclosed for capturing a segment of speech audio, performing phoneme recognition on the segment of speech audio to produce a segmented phoneme sequence, comparing the segmented phoneme sequence to stored phoneme sequences that represent incorrect pronunciations of words to determine if there is a match, and identifying an incorrect pronunciation for a word in the segment of speech audio. The system builds a library based on the data collected for the incorrect pronunciations.

Type: Grant

Filed: December 7, 2018

Date of Patent: June 22, 2021

Assignee: SoundHound, Inc.

Inventors: Katayoun Norouzi, Karl Stahl
Electronic device, recognition method, and non-transitory computer-readable storage medium

Patent number: 11042705

Abstract: According to one embodiment, an electronic device comprises a memory that stores dictionary data, a voice input receiver, and a hardware processor. The dictionary data comprises first dictionary data and updatable second dictionary data. A number of voice commands in the first dictionary data is greater than a number of voice commands in the second dictionary data. The first dictionary data is divided into sub-dictionaries. The hardware processor recognizes the received voice using at least one of the sub-dictionaries or the second dictionary data.

Type: Grant

Filed: May 31, 2019

Date of Patent: June 22, 2021

Assignee: Dynabook Inc.

Inventor: Midori Nakamae
Rapid generation of equivalent terms for domain adaptation in a question-answering system

Patent number: 11036803

Abstract: An approach is provided that receives a question at a question-answering (QA) system. The received question includes one or more terms, and the question pertains to a subject matter domain that is supported by the QA system. Analyzing a number of expressions included in a set of question-answer pairs (QA pairs), with the QA pairs being ground-truths established to in support of the subject matter domain. The analysis identifies whether a selected term from the question is a synonym for any of the expressions. The expressions that are identified as synonyms are then used in a QA pipeline that generates one or more candidate answers to the received question.

Type: Grant

Filed: April 10, 2019

Date of Patent: June 15, 2021

Assignee: International Business Machines Corporation

Inventors: Stephen A. Boxwell, Keith G. Frost, Stanley J. Vernier, Kyle M. Brake
Continuous topic detection and adaption in audio environments

Patent number: 11031005

Abstract: A mechanism is described for facilitating continuous topic detection and adaption in audio environments, according to one embodiment. A method of embodiments, as described herein, includes detecting a term relating to a topic in an audio input received from one or more microphones of the computing device including a voice-enabled device; analyzing the term based on the topic to determine an action to be performed by the computing device; and triggering an event to facilitate the computing device to perform the action consistent with the term and the topic.

Type: Grant

Filed: December 17, 2018

Date of Patent: June 8, 2021

Assignee: INTEL CORPORATION

Inventors: Georg Stemmer, Andrzej Mialkowski, Joachim Hofer, Piotr Rozen, Tomasz Szmelczynski
Electronic device and control method thereof

Patent number: 11030996

Abstract: The present invention relates to an electronic device and a control method thereof. The electronic device comprises a microphone for acquiring sound; and a control unit for determining whether the acquired sound is a learned sound and outputting information on the acquired sound on the basis of a determination result.

Type: Grant

Filed: October 14, 2016

Date of Patent: June 8, 2021

Assignee: LG ELECTRONICS INC.

Inventors: Jiyoung Huh, Jongcheol Shin, Sunryang Kim

prev 1 2 3 4 5 6 7 8 … next