Patents Examined by Qi Han
  • Patent number: 11430464
    Abstract: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.
    Type: Grant
    Filed: December 3, 2018
    Date of Patent: August 30, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
  • Patent number: 11423898
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example method includes receiving, from one or more external electronic devices, a plurality of speaker profiles for a plurality of users; receiving a natural language speech input; determining, based on comparing the natural language speech input to the plurality of speaker profiles: a first likelihood that the natural language speech input corresponds to a first user of the plurality of users; and a second likelihood that the natural language speech input corresponds to a second user of the plurality of users; determining whether the first likelihood and the second likelihood are within a first threshold; and in accordance with determining that the first likelihood and the second likelihood are not within the first threshold: providing a response to the natural language speech input, the response being personalized for the first user.
    Type: Grant
    Filed: March 11, 2020
    Date of Patent: August 23, 2022
    Assignee: Apple Inc.
    Inventors: Stephen H. Shum, Corey J. Peterson, Sachin S. Kajarekar, Benjamin S. Phipps, Erik Marchi, Jessica Peck, Anumita Biswas, Chaitanya Mannemala
  • Patent number: 11412325
    Abstract: A method of providing audio information from a meeting includes receiving a first audio stream from a first input audio device and a second audio stream from a second input audio device during the meeting, identifying a first audio fragment from the first audio stream, and identifying a second audio fragment from the second audio stream. The method also includes compiling the audio fragments from the first and second audio streams into an audio file that includes at least the first audio fragment and the second audio fragment. The method further includes providing the audio file to one or more recipients. The audio file identifies the first audio fragment as corresponding to a first participant of the meeting and the second audio fragment as corresponding to a second participant of the meeting.
    Type: Grant
    Filed: June 17, 2020
    Date of Patent: August 9, 2022
    Assignee: EVERNOTE CORPORATION
    Inventors: Andrew Sinkov, Alexander Pashintsev
  • Patent number: 11404059
    Abstract: Systems and methods for screenless computerized social-media access may include (1) receiving, from a user device, data describing an audible user response to a segment of an audiobook that was transmitted to the user device from an audiobook service, (2) creating a digital response-indicator indicative of the audible user response, and (3) providing the digital response-indicator to an additional user device. Various other methods, systems, and computer-readable media are also disclosed.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: August 2, 2022
    Assignee: Meta Platforms, Inc.
    Inventor: Debashish Paul
  • Patent number: 11404073
    Abstract: A system configured to improve double-talk detection. The system inputs microphone signals into an adaptive filter and determines whether double talk is present based on how the adaptive filter adapts to the microphone signals. For example, when an audible sound is detected, the adaptive filter updates the filter coefficients that correspond to a time difference of arrival of the audible sound. Thus, the device may detect single-talk conditions (e.g., a single peak in the filter coefficients) or double-talk conditions (e.g., two peaks in the filter coefficients). In addition, the device may track a location of the local speech or remote speech over time based on the time difference of arrival.
    Type: Grant
    Filed: December 13, 2018
    Date of Patent: August 2, 2022
    Assignee: Amazon Technologies, Inc.
    Inventor: Xianxian Zhang
  • Patent number: 11404052
    Abstract: In a service data processing method performed by a server, user speech information collected by a first terminal is received. A target service operation code according to the user speech information is obtained. The target service operation code is used for identifying target service operation information. The target service operation code is transmitted from the server to the first terminal, so that the first terminal plays the target service operation code by using a speech. The target service operation code obtained by a second terminal is received. A target execution page corresponding to the target service operation code is searched for. The target execution page is transmitted to the second terminal, so that the second terminal executes a service operation corresponding to the target.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: August 2, 2022
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Jinglin Ma, Xuewei Fang
  • Patent number: 11393467
    Abstract: An electronic device includes a voice receiving unit configured to receive a voice input, a first communication unit configured to communicate with an external device having a voice recognition function, and a control unit. The control unit receives a notification indicating whether the external device is ready to recognize the voice input, via the first communication unit. In a case where the notification indicates that the external device is not ready to recognize the voice input, the control unit controls the external device to be ready to recognize the voice input via the first communication unit when a predetermined voice input including a phrase corresponding to the external device is received through the voice receiving unit.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: July 19, 2022
    Assignee: Canon Kabushiki Kaisha
    Inventor: Shunji Fujita
  • Patent number: 11373544
    Abstract: A method includes displaying a first set of text content characterized by a first difficulty level. The method includes obtaining speech data associated with the first set of text content. The method includes determining linguistic feature(s) within the speech data. The method includes in response to completion of the speech data, determining a reading proficiency value associated with the first set of text content and based on the linguistic feature(s). The method includes in accordance with determining the reading proficiency value satisfies change criteria, changing a difficulty level for a second set of text content. After changing the difficulty level, the second set of text content corresponds to a second difficulty level different from the first difficulty level. The method includes in accordance with determining the reading proficiency value does not satisfy the change criteria, maintaining the second set of text content at the first difficulty level.
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: June 28, 2022
    Assignee: APPLE INC.
    Inventors: Barry-John Theobald, Russell Y. Webb, Nicholas Elia Apostoloff
  • Patent number: 11361768
    Abstract: A method includes receiving a spoken utterance that includes a plurality of words, and generating, using a neural network-based utterance classifier comprising a stack of multiple Long-Short Term Memory (LSTM) layers, a respective textual representation for each word of the of the plurality of words of the spoken utterance. The neural network-based utterance classifier trained on negative training examples of spoken utterances not directed toward an automated assistant server. The method further including determining, using the respective textual representation generated for each word of the plurality of words of the spoken utterance, that the spoken utterance is one of directed toward the automated assistant server or not directed toward the automated assistant server, and when the spoken utterance is directed toward the automated assistant server, generating instructions that cause the automated assistant server to generate a response to the spoken utterance.
    Type: Grant
    Filed: July 21, 2020
    Date of Patent: June 14, 2022
    Assignee: Google LLC
    Inventors: Nathan David Howard, Gabor Simko, Maria Carolina Parada San Martin, Ramkarthik Kalyanasundaram, Guru Prakash Arumugam, Srinivas Vasudevan
  • Patent number: 11361756
    Abstract: In one aspect, a playback device includes at least one microphone configured to detect sound. The playback detects sound via the one or more microphones and determines whether (i) the detected sound includes a voice input, (ii) the detected sound excludes background speech, and (iii) the voice input includes a command keyword. In response to the determining, the playback device performs a playback function corresponding to the command keyword.
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: June 14, 2022
    Assignee: Sonos, Inc.
    Inventors: Connor Smith, John Tolomei, Kurt Soto
  • Patent number: 11361783
    Abstract: The present invention provides a computer-aided conversion test system and method for generating intelligible speech. The test system includes an acoustic test module with a nasal-genio-oropharyngeal tract, a transmitting module generates a detecting signal, a first receiving module, a second receiving module, and a central processing module with a plurality of first phonetically oral cavity shape spectra. By adjusting the transmitting module, the first receiving module, or the second receiving module, a second phonetically oral cavity shape spectrum is correctly compared and identified by a central computing unit as one of the corresponding first phonetically oral cavity shape spectra. After testing, training and adjusting through the test method, the detecting signal transmitted by the transmitting module is analyzed and identified by the central processing module to increase its interpretation accuracy and shorten the time of machine learning.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: June 14, 2022
    Assignee: TS VOICE TECHNOLOGY, LLC
    Inventors: Shu-Wei Tsai, Heng-chin Yeh, Yi-Hsin Chen
  • Patent number: 11354516
    Abstract: An information processor includes a generation section that generates a specified character string on the basis of at least one of voice information corresponding to a content of speech detected by a voice detection section and vehicle information acquired from a vehicle. With this configuration, a user can input the specified character string, which is a hashtag, without an operation. Thus, compared to the related art in which the hashtag is generated on the basis of the operation (manual input) by the user, a burden on the user can significantly be reduced, and an input error can be prevented.
    Type: Grant
    Filed: November 22, 2019
    Date of Patent: June 7, 2022
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventors: Ryotaro Fujiwara, Keiko Suzuki, Makoto Honda, Chikage Kubo, Ryota Okubi, Takeshi Fujiki
  • Patent number: 11341967
    Abstract: A method for a voice interaction is provided according to embodiments of the disclosure, the method belonging to the field of smart devices. The method may include: receiving an external input; checking a current time in response to the external input; calling a voice program; and raising a question according to the current time and the called voice program, and playing the called voice program. The method and apparatus for a voice interaction may perform more immersive interaction with a user, thereby improving the user experience.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: May 24, 2022
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Dongli Liu, Xiaocheng Dai, Jian Peng
  • Patent number: 11335323
    Abstract: A method is provided for communicating a non-speech message as audio from a first device to a second device such that information can be passed between the first and second device. The method includes: encoding the non-speech message as a dissimilar speech message having a plurality of phonemes; transmitting the speech message over one or more audio communications channels from the first device; receiving the speech message at the second device; recognizing the speech message; and decoding the dissimilar speech message to the non-speech message. By using existing audio functionality, and the increasingly more reliable voice recognition applications, an improved method is provided for sharing complex data messages using commonly available communication channels.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: May 17, 2022
    Assignee: MASTERCARD INTERNATIONAL INCORPORATED
    Inventor: Robert Collins
  • Patent number: 11335356
    Abstract: A local extremum calculator detects a local maximum sample and a local minimum sample of a digital audio signal. A number-of-sample detector detects a sample interval between the local maximum sample and the local minimum sample. A difference value calculator calculates difference values between adjacent samples. A correction value calculator calculates a first correction value by multiplying the difference value between the local maximum sample and a first adjacent sample by a coefficient and calculates a second correction value by multiplying the difference value between the local minimum sample and a second adjacent sample by the coefficient. When a periodic signal detector detects that the digital audio signal is a single sine wave, an adder/subtractor does not add the first correction value to the first adjacent sample, and does not subtract the second correction value from the second adjacent sample.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: May 17, 2022
    Assignee: JVCKENWOOD CORPORATION
    Inventor: Sadahiro Yasura
  • Patent number: 11335339
    Abstract: A voice interaction method and apparatus, a terminal, a server, and a readable storage medium are provided. The method includes the following steps: obtaining a user's demand according to the user's voice; determining a pre-stored task template matched with the user's demand; matching the user's demand with a necessary slot in the matched task template; and if the user's demand lacks content of the necessary slot, executing a step of obtaining the content of the necessary slot, to obtain the content of the necessary slot; wherein the task template is a template generated in advance according to information required for activating a task operation through voice, the slot is information in the task template, and the necessary slot is necessary information in the task template for activating the task operation.
    Type: Grant
    Filed: August 1, 2018
    Date of Patent: May 17, 2022
    Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co., Ltd.
    Inventor: Tian Wang
  • Patent number: 11335327
    Abstract: Disclosed herein are system, method, and computer program product embodiments for a text-to-speech system. An embodiment operates by identifying a document including text, wherein the text includes both a structured portion of text, and an unstructured portion of text. Both the structured portion and unstructured portions of the text are identified within the document rich data, wherein the structured portion corresponds to a rich data portion that includes both a descriptor and content, and wherein an unstructured portion of the text includes alphanumeric text. A request to audibly output the document including the rich data portion is received from a user profile. A summary of the rich data portion is generated at level of detail corresponding to the user profile. The audible version of the document including both the alphanumeric text of the unstructured portion of the document and the generated summary is audibly output.
    Type: Grant
    Filed: June 24, 2020
    Date of Patent: May 17, 2022
    Assignee: CAPITAL ONE SERVICES, LLC
    Inventors: Galen Rafferty, Reza Farivar, Anh Truong, Jeremy Goodsitt, Vincent Pham, Austin Walters
  • Patent number: 11322144
    Abstract: Disclosed are an information providing device and an information providing method, which provide information enabling a conversation with a user by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm in a 5G environment connected for Internet-of-Things. An information providing method according to one embodiment of the present disclosure includes gathering first situational information from a home monitoring device, gathering, from the first electronic device, second situational information corresponding to the first situational information, gathering, from the home monitoring device, third situational information containing a behavioral change of the user after gathering the first situational information, generating a spoken sentence to provide to the user on the basis of the first situational information to the third situational information, and converting the spoken sentence to spoken utterance information to be output to the user.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: May 3, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Ji Chan Maeng, Jong Hoon Chae
  • Patent number: 11315562
    Abstract: The main object of the present application is to provide a method and device for information interaction by quantitatively inputting voice indicators and processing and interacting information, the method comprising: displaying and persistently flashing a first prompt word and starting to receive a first input voice of a user; comparing the first input voice with the first prompt word; displaying and persistently flashing a second prompt word and starting to receive a second input voice of the user; comparing the second input voice with the second prompt word; and if the second input voice is matched with the second prompt word, then integrating the first input voice and the second input voice to be a digital voice file, and storing the digital voice file. The present application can help the user correctly, quickly and simply record sound, and reduce interference factors to the least, and can accurately, completely and conveniently acquire user sound, thus facilitating subsequent analysis and recognition.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: April 26, 2022
    Inventor: Zhonghua Ci
  • Patent number: 11308954
    Abstract: Provided are a method of associating an AI device with a device based on a behavior pattern of a user and a device therefor. The method of associating the AI device with the device according to an embodiment of the invention receives a preset behavior pattern of the user sensed by a first camera from the first camera, receives a voice command for controlling an operation of the device from the user, and transmits the voice command to the device, thus allowing devices having no AI function to be used in conjunction with the AI device. The AI device and the dive of the invention may be associated with artificial intelligence modules, drones (unmanned aerial vehicles (UAVs)), robots, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G service, etc.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: April 19, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Jichan Maeng