Recognition Patents (Class 704/231)
  • Patent number: 11210059
    Abstract: A method and system for modifying an audible command is provided. The method includes continuously receiving audible commands associated with a context of interactions between a user and individuals. The audible commands are analyzed with respect to associated actions and user attributes of the audible commands are identified. Specified information required for executing each command of the audible commands and portions of the specified information associated with specified individuals of the individuals are determined. Digital audio samples of the user are retrieved and assigned to the portions of the specified information with respect to each command. The associated actions are modified with respect to the specified individuals and self-learning software code comprising the modified actions is generated and executed such that the commands are executed with respect to the modified actions.
    Type: Grant
    Filed: June 25, 2019
    Date of Patent: December 28, 2021
    Assignee: International Business Machines Corporation
    Inventors: Craig M. Trim, Garfield W. Vaughn, Shubhadip Ray, Sarbajit K. Rakshit
  • Patent number: 11204929
    Abstract: Mechanisms, in a Question Answering (QA) system comprising a processor and a memory, for evaluating a hypothetical link in an ontology are provided. An initial analysis of the ontology is performed to identify a set of information concept entities and links between information concept entities in the ontology. The hypothetical link between a first information concept entity and a second information concept entity in the ontology is generated based on the initial analysis of the ontology. Natural language questions corresponding to the hypothetical link are processed to generate answer results directed to a plurality of links between a plurality of information concept entities. The answer results are aggregated across the plurality of links to determine an aggregate answer result for the hypothetical link. An indication of whether or not the hypothetical link is a valid link is output based on the aggregate answer result for the hypothetical link.
    Type: Grant
    Filed: November 18, 2014
    Date of Patent: December 21, 2021
    Assignee: International Business Machines Corporation
    Inventors: Darryl M. Adderly, Corville O. Allen, Robert K. Tucker
  • Patent number: 11200884
    Abstract: Techniques for labeling user inputs for updating user recognition voice profiles are described. A system may leverage various signals, generated during or after processing of a user input, to retroactively determine which user spoke the user input. For example, after the system receives the user input, the user may provide the system with non-spoken user verification information. Based on such user verification information, the system may label the previously spoken user input as originating from the particular user. The system may also or alternatively use system usage history to retroactively label user inputs.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: December 14, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Sundararajan Srinivasan, Arindam Mandal, Krishna Subramanian, Spyridon Matsoukas, Aparna Khare, Rohit Prasad
  • Patent number: 11195510
    Abstract: Systems, methods, and computer-readable storage media for providing for intelligent switching of languages and/or pronunciations in a text-to-speech system. As the system receives text, the text is analyzed to identify portions which should have speech constructed using a pronunciation distinct from the remaining portions of the text. The text-to-speech system uses multiple pronunciation dictionaries to generate and produce speech corresponding to the text, where the identified portions of the text are in a different language or have a different accent from the remainder of the text. Having generated speech corresponding to the text in multiple languages, accents, or dialects, the system combines the portions, then communicates the speech to the text recipient.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: December 7, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Gregory Pulz, Harry E. Blanchard, Lan Zhang
  • Patent number: 11188199
    Abstract: A website navigation system has an analysis system which receives a request for an Internet web page from a client device. The analysis system receives web page data associated with the web page from the Internet and performs a data analysis process to organize the web page data for use in a virtual conversation with the user in order to present the web page in an audible format. The analysis system identifies separate elements of the web page from the web page data and extracts information from the separate elements based on the web page data. The analysis system groups the separate elements into categories based on the extracted information and sorts the groups of separate elements based on usage statistics. The analysis system then generates a prompt for being output to the user by the client device as audible output based on the sorted groups of separate elements.
    Type: Grant
    Filed: April 16, 2018
    Date of Patent: November 30, 2021
    Assignee: International Business Machines Corporation
    Inventors: Florian Pinel, Donna K. Byron, Christian Ewen, Carmine Dimascio, Benjamin L. Johnson
  • Patent number: 11188290
    Abstract: Provided are an electronic device, a control method thereof, and a sound output control system of the electronic device, for example, a technique for controlling sound that is output from an electronic device located in the same space as a voice recognition device.
    Type: Grant
    Filed: October 17, 2019
    Date of Patent: November 30, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jung Su Kim, Ho Jin Eo
  • Patent number: 11190632
    Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.
    Type: Grant
    Filed: October 8, 2020
    Date of Patent: November 30, 2021
    Inventor: Iwao Fujisaki
  • Patent number: 11189276
    Abstract: A vehicle includes a communication device configured to communicate with a terminal capable of providing a communication function; a sensor configured to receive voice of a user; a storage configured to store a user pattern related to a call pattern of the user; and a controller configured to search for at least one name candidate corresponding to input voice when receiving the input voice, determine a threshold for a confidence score of the at least one name candidate based on the user pattern, and select a name corresponding to the input voice from among the at least one name candidate based on the determined threshold.
    Type: Grant
    Filed: February 1, 2019
    Date of Patent: November 30, 2021
    Assignees: Hyundai Motor Company, Kia Motors Corporation
    Inventor: Kyung Chul Lee
  • Patent number: 11184469
    Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.
    Type: Grant
    Filed: October 8, 2020
    Date of Patent: November 23, 2021
    Inventor: Iwao Fujisaki
  • Patent number: 11184468
    Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.
    Type: Grant
    Filed: October 8, 2020
    Date of Patent: November 23, 2021
    Inventor: Iwao Fujisaki
  • Patent number: 11184470
    Abstract: The communication device comprising a communication implementer, a TV program implementer, and a multiple language implementer.
    Type: Grant
    Filed: October 8, 2020
    Date of Patent: November 23, 2021
    Inventor: Iwao Fujisaki
  • Patent number: 11176941
    Abstract: A method and system for interpreting data transfer from a voice recognition platform. The voice recognition platform data transfer may include a designator, device identification, a command or query, and a plurality of variables. The voice recognition string from the voice recognition platform may include the designator, the query, and at least one variable of the plurality of variables. Further, the method may include generating an instruction string from the recognition string. The instruction string may include the designator, the query and the at least one variable of the plurality of values. The method may include removing extraneous symbols from the instruction string to generate a cleaned instruction string comprising the designator, a cleaned query, and a cleaned variable of the at least one variable of the plurality of variables. The method may include searching a context platform database for data corresponding to the cleaned query.
    Type: Grant
    Filed: October 28, 2019
    Date of Patent: November 16, 2021
    Assignee: Connected Living Technology, LLC
    Inventors: Sarah Hoit, Brian McWade, Josiah Strandberg
  • Patent number: 11175154
    Abstract: Natural language directions are received and a set of maneuver/context pairs are generated based upon the natural language directions. The set of maneuver/context pairs are provided to a routing engine to obtain route information based upon the set of maneuver/context pairs. The route information is provided to an output system for surfacing to a user.
    Type: Grant
    Filed: November 20, 2018
    Date of Patent: November 16, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Elizabeth P. Salowitz, David Grochocki, Jr., Jeff West
  • Patent number: 11170765
    Abstract: A method for improving a transcription may include identifying, in the transcription, reliable channel tokens of an utterance of a reliable channel and an unreliable channel token of an utterance of an unreliable channel, and generating, using a machine learning model, a vector embedding for the unreliable channel token and vector embeddings for the reliable channel tokens. The method may further include calculating vector distances between the vector embedding and the vector embeddings, and generating, for the unreliable channel token and using the vector distances, a score corresponding to a reliable channel token. The method may further include determining that the score is within a threshold score, and in response to determining that the score is within the threshold score, replacing, in the transcription, the unreliable channel token with the reliable channel token.
    Type: Grant
    Filed: January 24, 2020
    Date of Patent: November 9, 2021
    Assignee: Intuit Inc.
    Inventors: Oren Sar Shalom, Yair Horesh, Alexander Zhicharevich, Elik Sror, Adi Shalev, Yehezkel Shraga Resheff
  • Patent number: 11172082
    Abstract: An information processing apparatus includes circuitry that receives, via a communication network, a first user request input in voice to a terminal, and reflects the first user request in a type or setting of a job. When a second user request input in voice to the terminal after the first user request is received via the communication network during the reflection of the first user request, the circuitry displays, on a display, information of the type or setting of the job reflecting a previous user request preceding the second user request. The previous user request includes the first user request.
    Type: Grant
    Filed: September 26, 2019
    Date of Patent: November 9, 2021
    Assignee: RICOH COMPANY, LTD.
    Inventor: Shun Yoshimi
  • Patent number: 11163905
    Abstract: A system for data sharing between a first user and a second user includes a memory and a processor. The processor is configured to execute instructions stored in the memory to associate a unique identifier with a first profile of the first user, the first profile includes user data; obtain, from a second device of the second user, a sensed identifier; and, in response to the sensed identifier matching the unique identifier of the first user, execute instruction to send, to the first user, a first request to share first user data of the first user with the second user; and receive, from the first user, a response to the first request to share the first user data of the first user with the second user. The sensed identifier is captured by a sensor of the second device.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: November 2, 2021
    Assignee: Ginko LLC
    Inventors: Ronald J. Czajka, Sam B. Attisha
  • Patent number: 11152084
    Abstract: Techniques for coding a medical report include identifying an acronym or abbreviation in the medical report, and a plurality of phrases not explicitly included in the medical report that are possible expanded forms of the acronym or abbreviation in the medical report. From the plurality of phrases, a most likely expanded form of the acronym or abbreviation may be selected by applying to the medical report a statistical acronym/abbreviation expansion model trained on a corpus of medical reports. By applying to the medical report with the expanded acronym or abbreviation one or more statistical fact extraction models, a clinical fact may be extracted from the medical report based at least in part on the most likely expanded form of the acronym or abbreviation in the medical report, and a corresponding medical taxonomy code may be assigned to the extracted clinical fact from the medical report.
    Type: Grant
    Filed: February 16, 2016
    Date of Patent: October 19, 2021
    Assignee: Nuance Communications, Inc.
    Inventors: Ravi Kondadadi, Girija Yegnanarayanan, Brian William Delaney, John Ortega
  • Patent number: 11152009
    Abstract: In a voice controlled system, multiple applications are configured to respond to various commands. The voice controlled system includes client devices and servers. The correct application to receive a natural language command is identified based on how well the command matches functions of the application. A target application to receive the command may additionally be selected based on which application is most likely to receive a command. Likelihood of an application receiving a command may be determined by considering context. The command may be a voice input to a client device that is analyzed by speech recognition technology to determine word strings representing possible commands. Thus, the selection of a target application to receive the command may be based on word strings from the natural language input, a closeness of fit between the command and an application, and/or the likelihood an application is the target for the next incoming command.
    Type: Grant
    Filed: August 14, 2017
    Date of Patent: October 19, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Jeffrey Penrod Adams
  • Patent number: 11138377
    Abstract: At least two processing device-implemented company name recognition components, operating upon a body of text in a document, identify at least one company name occurrence in the body of text based at least in part on a company identifier list. The company name recognition techniques implemented by each of the at least two company name recognition components are different from each other. The at least one company name occurrence is used to update the company identifier list. The updated company identifier list is then used by the at least two company name recognition components to identify at least one additional name occurrence in the same body of text. This process of repeatedly identifying occurrences of company names in the body of text and updating the company identifier list is performed until such time that no further company name occurrences are identified in the body of text.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: October 5, 2021
    Assignee: Freedin Solutions Group, LLC
    Inventors: David A. Cook, Andrzej H. Jachowicz, Phillip Karl Jones
  • Patent number: 11138976
    Abstract: Devices, systems, and methods are provided for automatic media device input scrolling. The system may receive voice data associated with a first device. The system may determine, based on the voice data, an input of the first device. The system may determine an active input of the first device. The system may determine a number of inputs from the active input to the input. The system may send one or more instructions based on the number of inputs.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: October 5, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Samuel Thomas Bailey, Bernardo De Carvalho e Silva, Mirosla Nadj, Damjan Majstorovic
  • Patent number: 11138975
    Abstract: In one aspect, a playback device includes at least one microphone configured to detect a voice input and generate sound input data. The playback device detects a first command keyword in the detected sound and, in response, makes a first determination, via a first local natural language unit (NLU), whether the input sound data includes at least one keyword within a first predetermined library of keywords. The playback device receives an indication of a second determination made by a second NLU that the input sound data includes at least one keyword from a second predetermined library of keywords. The playback device compares the results of the first determination and the second determination and, based on the comparison, foregoes further processing of the input sound data.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: October 5, 2021
    Assignee: Sonos, Inc.
    Inventors: Nick D'Amato, Connor Kristopher Smith
  • Patent number: 11132499
    Abstract: An automated natural dialogue system provides a combination of structure and flexibility to allow for ease of annotation of dialogues as well as learning and expanding the capabilities of the dialogue system based on natural language interactions.
    Type: Grant
    Filed: August 28, 2018
    Date of Patent: September 28, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Percy Shuo Liang, David Leo Wright Hall, Jesse Daniel Eskes Rusak, Daniel Klein
  • Patent number: 11132992
    Abstract: Generally discussed herein are devices, systems, and methods for on-device detection of a wake word. A device can include a memory including model parameters that define a custom wake word detection model, the wake word detection model including a recurrent neural network transducer (RNNT) and a lookup table (LUT), the LUT indicating a hidden vector to be provided in response to a phoneme of a user-specified wake word, a microphone to capture audio, and processing circuitry to receive the audio from the microphone, determine, using the wake word detection model, whether the audio includes an utterance of the user-specified wake word, and wake up a personal assistant after determining the audio includes the utterance of the user-specified wake word.
    Type: Grant
    Filed: July 25, 2019
    Date of Patent: September 28, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Emilian Stoimenov, Rui Zhao, Kaustubh Prakash Kalgaonkar, Ivaylo Andreanov Enchev, Khuram Shahid, Anthony Phillip Stark, Guoli Ye, Mahadevan Srinivasan, Yifan Gong, Hosam Adel Khalil
  • Patent number: 11132999
    Abstract: An information processing device according to the present application includes an extraction unit and a subsequent stage generation unit. The extraction unit extracts a last conversation of a feedback utterance estimated to indicate a predetermined reaction of a second utterance subject relative to an utterance made by a first utterance subject, from a set of a plurality of conversations, based on a score assigned to the feedback utterance. The subsequent stage generation unit generates a subsequent stage classifier for deriving an index indicating a category of an unknown conversation, based on the last conversation extracted by the extraction unit.
    Type: Grant
    Filed: September 4, 2018
    Date of Patent: September 28, 2021
    Assignee: YAHOO JAPAN CORPORATION
    Inventors: Chikara Hashimoto, Manabu Sassano
  • Patent number: 11125907
    Abstract: The present disclosure provides systems and methods for improved occupancy sensing. The methods and systems can deploy various signal threshold adjustments and/or signal analysis algorithms in response to sensed signals having a given quality, such as exceeding a threshold. In some cases, signal thresholds are lowered following an initial generated signal exceeding a first, higher threshold. In some cases, time-dependent signals are monitored using algorithms that analyze the signals for variations that are characteristic of human usage. Methods are disclosed for determining if two motion sensors are observing the same or overlapping spaces. Systems and methods for calibrating motion sensing systems are also disclosed.
    Type: Grant
    Filed: May 17, 2019
    Date of Patent: September 21, 2021
    Assignee: Steelcase Inc.
    Inventors: Michael Bloem, Marcus Ward, Mychal Hall
  • Patent number: 11120788
    Abstract: Provided is a system and method for acquiring training data and building an organizational-based language model based on the training data. In one example, the method may include collecting organizational data that is generated via one or more applications associated with an organization, aggregating the collected organizational data with previously collected organizational data to generate aggregated organizational training data, training an organizational-based language model for speech processing based on the aggregated organizational training data, and storing the trained organizational-based language model.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: September 14, 2021
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ziad Al Bawab, Anand U Desai, Cem Aksoylar, Michael Levit, Xin Meng, Shuangyu Chang, Suyash Choudhury, Dhiresh Rawal, Tao Li, Rishi Girish, Marcus Jager, Ananth Rampura Sheshagiri Rao
  • Patent number: 11120793
    Abstract: It is depicts a method of speech recognition, sequentially executed by a processor on consecutive speech segments that comprises: obtaining digital information, which is a spectrogram representation, of a speech segment, and extracting from it speech features that characterizes the segment from the spectrogram representation. Then, a consistent structure segment vector based on the speech features is determined onto which machine learning is deployed to determine at least one label of the segment vector. A method of voice recognition and image recognition sequentially executed by a processor, on consecutive voice segments is also described. A system for executing speech, voice, and image recognition is also provided that comprises client devices to obtain and display information, a segment vector generator to determine a consistent structure segment vector based on features, and a machine learning server to determine at least one label of the segment vector.
    Type: Grant
    Filed: June 11, 2017
    Date of Patent: September 14, 2021
    Assignee: VoicEncode Ltd.
    Inventor: Omry Netzer
  • Patent number: 11114101
    Abstract: A method of speech recognition and person identification based thereon, comprising: recording speech from a speech signal using a microphone; illuminating a speaking mouth; recording a degree of light reflected by the mouth from a reflection signal using a sensor; and recording combined parameters of the speech signal and of the reflection signal, and coupling them to letters associated therewith, per predetermined time duration; comparing a combination occurring in speech of parameters of the speech signal and of the reflection signal to the recorded combined parameters of the speech signal and of the reflection signal which are coupled to letters; and deciding on the basis of the comparison to which letter the combination occurring in the speech of parameters of the speech signal and of the reflection signal corresponds, using block-width modulation of the reflection signal.
    Type: Grant
    Filed: January 25, 2019
    Date of Patent: September 7, 2021
    Assignee: IEBM B.V.
    Inventors: Olaf Petrus Quirinus Mossinkoff, Johannes Leonardus Jozef Meijer
  • Patent number: 11102568
    Abstract: An automatic speech recognition (ASR) triggering system, and a method of providing an ASR trigger signal, is described. The ASR triggering system can include a microphone to generate an acoustic signal representing an acoustic vibration and an accelerometer worn in an ear canal of a user to generate a non-acoustic signal representing a bone conduction vibration. A processor of the ASR triggering system can receive an acoustic trigger signal based on the acoustic signal and a non-acoustic trigger signal based on the non-acoustic signal, and combine the trigger signals to gate an ASR trigger signal. For example, the ASR trigger signal may be provided to an ASR server only when the trigger signals are simultaneously asserted. Other embodiments are also described and claimed.
    Type: Grant
    Filed: April 26, 2019
    Date of Patent: August 24, 2021
    Assignee: APPLE INC.
    Inventors: Sorin V. Dusan, Aram M. Lindahl, Robert D. Watson
  • Patent number: 11100935
    Abstract: Embodiments of present disclosure relates to a voice assistant device and method for controlling the voice assistant device. The voice assistant device comprising receiver configured to receive at least one voice input from user, when operated in wake-up mode. Intent associated with the at least one voice input from the at least one user. Further, probability of issuance of a subsequent voice input from the at least one user is determined based on at least one of the intent, historic data and one or more contextual factors. An extended wake-up duration of the voice assistant device is estimated, when the probability is greater than a predefined threshold value. Further, duration of the wake-up mode is extended for the extended wake-up duration to receive the subsequent voice input from the at least one user.
    Type: Grant
    Filed: June 5, 2019
    Date of Patent: August 24, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Vijaya Kumar Tukka, Chethan Konanakere Puttanna, Deepraj Prabhakar Patkar, Sulochan Naik, Harish Bishnoi
  • Patent number: 11099812
    Abstract: Provided is a device including a display, an audio inputter, and a controller. The display displays at least one screen page of an application that is being executed. The audio inputter receives a voice command of a user. The controller performs an operation corresponding to the voice command by using screen page transition information for transition between application screen pages corresponding to the voice command, which is obtained from information about user interface (UI) elements included in the application screen pages of the application. Each of the UI elements performs a predetermined function when selected by the user.
    Type: Grant
    Filed: August 4, 2020
    Date of Patent: August 24, 2021
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Han-min Bang, Hyok-sung Choi
  • Patent number: 11096073
    Abstract: Techniques are disclosed for determining a performance criterion for a client device. A performance criterion for a client device may be determined based on a rate of mobility of a client device. Additionally or alternatively, a performance criterion for a client device associated with a particular attribute may be determined based on performance levels of a set of client devices associated with the same particular attribute. The performance criterion is used to evaluate a performance level of a client device. If the performance criterion is not satisfied, then a wireless configuration is modified to improve the performance level.
    Type: Grant
    Filed: February 15, 2016
    Date of Patent: August 17, 2021
    Assignee: Facebook, Inc.
    Inventor: Subbu Ponnuswamy
  • Patent number: 11087745
    Abstract: To provide a speech recognition results re-ranking technology for re-ranking speech recognition results so as to render speech recognition results suitable for intended use of speech recognition while reducing preparation costs required prior to execution of re-ranking processing of speech recognition results. A speech recognition results re-ranking device includes: a speech recognition unit 210 that generates a speech recognition result set with recognition score from speech data; and a re-ranking unit 220 that generates a speech recognition result set with integrated score from the speech recognition result set with recognition score by using a word vector expression database, a cluster center vector expression database, and a normalized knowledge information word DF value database.
    Type: Grant
    Filed: December 19, 2017
    Date of Patent: August 10, 2021
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takashi Nakamura, Nobuaki Hiroshima, Setsuo Yamada
  • Patent number: 11089405
    Abstract: An apparatus comprising: an analyser configured to analyse at least one input to determine one or more expression within the at least one input; and a controller configured to control at least one audio signal associated with the at least one input dependent on the determination of the one or more expression.
    Type: Grant
    Filed: March 14, 2012
    Date of Patent: August 10, 2021
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Roope Olavi Jarvinen, Kari Juhani Järvinen, Juha Henrik Arrasvuori, Miikka Vilermo
  • Patent number: 11087752
    Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.
    Type: Grant
    Filed: August 22, 2018
    Date of Patent: August 10, 2021
    Assignee: Google LLC
    Inventors: Bo Wang, Subbaiah Venkata, Chad Yoshikawa, Chris Ramsdale, Pravir Gupta, Alfonso Gomez-Jordana, Kevin Yeun, Jae Won Seo, Lantian Zheng, Sang Soo Sung
  • Patent number: 11087763
    Abstract: A voice recognition method is provided by embodiments of the present application. The method includes: obtaining a voice signal to be recognized; and recognizing a current frame in the voice signal using a pre-trained causal acoustic model, according to the current frame in the voice signal and a frame within a preset time period before the current frame, the causal acoustic model being derived based on a causal convolutional neural network training. In the method provided by the embodiments of the present application, only the information of the current frame and the frame before the current frame is used when performing the recognition of the current frame, thereby solving a problem in voice recognition technologies based on prior art convolutional neural network where a hard delay is created because there is a need to wait for the frames after the current frame, improving the timeliness of the voice recognition.
    Type: Grant
    Filed: December 28, 2018
    Date of Patent: August 10, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chao Li, Weixin Zhu, Ming Wen
  • Patent number: 11087766
    Abstract: A dynamic speech processing system and method is provided. The system includes a receiver configured to receive a plurality of audio files. The audio files include sample training audio files and run-time audio files. The system further includes a speech processor coupled to the receiver and configured to compute a variable value for a specific audio file. The speech processor is configured to dynamically select a set of relevant speech recognition engines for a specific run-time audio file based on the variable value.
    Type: Grant
    Filed: March 8, 2018
    Date of Patent: August 10, 2021
    Assignee: Uniphore Software Systems
    Inventors: Sachdev Umesh, Pattabhiraman Thiyagarajasarma, Gopalakrishnan Gururaghavendran
  • Patent number: 11086863
    Abstract: Methodologies are provided for generating, organizing, storing and retrieving medical records using voice recognition in combination with unique codes assigned to data elements, and include microprocessor and memory, such as non-transient computer readable medium, having stored thereon a database including vocabulary terms. Methods include receiving spoken language via a speech recognition interface, and generating on a display an output according to vocabulary terms uniquely associated with the spoken language. Data stored in the database can include records organized into specific modules having specified vocabulary terms synced with each module and unique computer code to key vocabulary terms in the database. Using an associated unique code can cause specific data field to open on display when recognizing specific spoken word or phrase by the speech recognition interface.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: August 10, 2021
    Inventor: Jeffrey E. Koziol
  • Patent number: 11081108
    Abstract: Embodiments of the present disclosure disclose an interaction method and apparatus. A specific embodiment of the method includes: generating, in response to determining that a request input by a user satisfies a guiding condition, guiding information, and feeding back the guiding information to the user, the guiding condition including one of the following: associating with a plurality of query intents, or associating with no query intent; and generating, based on the request and a feedback input by the user corresponding to the guiding information, an intent-clear request, and feeding back push information bound with the intent-clear request to the user. Realizing that in the process of interacting with the user, for conditions such as the request input by the user is associated with a plurality of query intents or incompleteness, an intent-clear request associated with an explicit query intent is determined through the interaction with the user.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: August 3, 2021
    Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co. Ltd.
    Inventors: Mengmeng Zhang, Zhongji Fan, Lei Shi, Li Wan, Qiang Ju, Chao Yin, Wei Shen, Jian Xie, Ran Xu, Jingya Wang
  • Patent number: 11081106
    Abstract: A spoken dialogue system includes a spoken language understanding apparatus. The spoken language understanding apparatus can include an intent apparatus and a selection apparatus. The intent apparatus is configured to determine if a query comprises a global command, to determine if an intent associated with a query is or is not included in a domain that is supported by the spoken dialogue system, to determine if a query comprises a confirmation type, to tag one or more entities in a query, and to determine an intent probability distribution and a domain probability distribution that is associated with a query. When the query includes an entity that is included in two or more possible entities, the selection apparatus is configured to provide a score for each of the two or more possible entities.
    Type: Grant
    Filed: August 25, 2017
    Date of Patent: August 3, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xihui Lin, Andrew James McNamara, Jing He
  • Patent number: 11074913
    Abstract: Various embodiments are provided for understanding user sentiment in a dialog system in a computing environment by a processor. A sentiment of a user may be detected according to a sentiment analysis and user feedback during a dialog with the user. One or more reasons for the sentiment of the user may be identified. Behavior of the dialog system may be adjusted according to the one or more reasons.
    Type: Grant
    Filed: January 3, 2019
    Date of Patent: July 27, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oznur Alkan, Adi I. Botea, Elizabeth Daly, Matthew Davis, Christian Muise
  • Patent number: 11069345
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: July 20, 2021
    Assignee: DeepMind Technologies Limited
    Inventors: Aaron Gerard Antonius van den Oord, Sander Etienne Lea Dieleman, Nal Emmerich Kalchbrenner, Karen Simonyan, Oriol Vinyals, Lasse Espeholt
  • Patent number: 11048750
    Abstract: A conversation topic providing method includes: converting voice data, of a conversation of a user who is on a phone, into text; selecting a keyword, indicating an intention of the user, from the text; obtaining information of interest with respect to the keyword; and determining topics relating to the keyword based on user information.
    Type: Grant
    Filed: August 5, 2014
    Date of Patent: June 29, 2021
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Hue-yin Kim, Sang-il Lee, Sung-kyu Lee, Seong-seol Hong, Jung-hoon Shin, Yeon-woo Lee
  • Patent number: 11043209
    Abstract: Methods and systems for training one or more neural networks for transcription and for transcribing a media file using the trained one or more neural networks are provided. One of the methods includes: segmenting the media file into a plurality of segments; extracting, using a first neural network, audio features of a first and second segment of the plurality of segments; and identifying, using a second neural network, a best-candidate engine for each of the first and second segments based at least on audio features of the first and second segments. A best-candidate engine is a neural network having a highest predicted transcription accuracy among a collection of neural networks.
    Type: Grant
    Filed: January 8, 2019
    Date of Patent: June 22, 2021
    Inventors: Peter Nguyen, David Kettler, Karl Schwamb, Chad Steelberg
  • Patent number: 11043212
    Abstract: There is disclosed a system that, when in operation, evaluates speech, for example evaluates a speech signal generated using a microphone to record an oral utterance.
    Type: Grant
    Filed: November 29, 2018
    Date of Patent: June 22, 2021
    Inventor: Peter Bell
  • Patent number: 11043213
    Abstract: A system and method are disclosed for capturing a segment of speech audio, performing phoneme recognition on the segment of speech audio to produce a segmented phoneme sequence, comparing the segmented phoneme sequence to stored phoneme sequences that represent incorrect pronunciations of words to determine if there is a match, and identifying an incorrect pronunciation for a word in the segment of speech audio. The system builds a library based on the data collected for the incorrect pronunciations.
    Type: Grant
    Filed: December 7, 2018
    Date of Patent: June 22, 2021
    Assignee: SoundHound, Inc.
    Inventors: Katayoun Norouzi, Karl Stahl
  • Patent number: 11042705
    Abstract: According to one embodiment, an electronic device comprises a memory that stores dictionary data, a voice input receiver, and a hardware processor. The dictionary data comprises first dictionary data and updatable second dictionary data. A number of voice commands in the first dictionary data is greater than a number of voice commands in the second dictionary data. The first dictionary data is divided into sub-dictionaries. The hardware processor recognizes the received voice using at least one of the sub-dictionaries or the second dictionary data.
    Type: Grant
    Filed: May 31, 2019
    Date of Patent: June 22, 2021
    Assignee: Dynabook Inc.
    Inventor: Midori Nakamae
  • Patent number: 11036803
    Abstract: An approach is provided that receives a question at a question-answering (QA) system. The received question includes one or more terms, and the question pertains to a subject matter domain that is supported by the QA system. Analyzing a number of expressions included in a set of question-answer pairs (QA pairs), with the QA pairs being ground-truths established to in support of the subject matter domain. The analysis identifies whether a selected term from the question is a synonym for any of the expressions. The expressions that are identified as synonyms are then used in a QA pipeline that generates one or more candidate answers to the received question.
    Type: Grant
    Filed: April 10, 2019
    Date of Patent: June 15, 2021
    Assignee: International Business Machines Corporation
    Inventors: Stephen A. Boxwell, Keith G. Frost, Stanley J. Vernier, Kyle M. Brake
  • Patent number: 11031005
    Abstract: A mechanism is described for facilitating continuous topic detection and adaption in audio environments, according to one embodiment. A method of embodiments, as described herein, includes detecting a term relating to a topic in an audio input received from one or more microphones of the computing device including a voice-enabled device; analyzing the term based on the topic to determine an action to be performed by the computing device; and triggering an event to facilitate the computing device to perform the action consistent with the term and the topic.
    Type: Grant
    Filed: December 17, 2018
    Date of Patent: June 8, 2021
    Assignee: INTEL CORPORATION
    Inventors: Georg Stemmer, Andrzej Mialkowski, Joachim Hofer, Piotr Rozen, Tomasz Szmelczynski
  • Patent number: 11030996
    Abstract: The present invention relates to an electronic device and a control method thereof. The electronic device comprises a microphone for acquiring sound; and a control unit for determining whether the acquired sound is a learned sound and outputting information on the acquired sound on the basis of a determination result.
    Type: Grant
    Filed: October 14, 2016
    Date of Patent: June 8, 2021
    Assignee: LG ELECTRONICS INC.
    Inventors: Jiyoung Huh, Jongcheol Shin, Sunryang Kim