Patents Examined by Abul K. Azad
  • Patent number: 12293775
    Abstract: A voice control method and apparatus, a chip, earphones, and a system. The method includes: recognizing (001) whether a voice signal includes a keyword; in response to the voice signal including the keyword, executing (001a) an instruction corresponding to the keyword or sending the instruction; before recognizing whether the voice signal includes the keyword, determining (002) whether the voice signal is from a target user and, in response to the voice signal being from the target user, starting to recognize (001) whether the voice signal includes the keyword; or during recognizing whether the voice signal includes the keyword, determining (002) whether the voice signal is from the target user and, in response to the voice signal being from a non-target user, stopping recognizing (003a) whether the voice signal includes the keyword. The voice control method reduces the power consumption of voice control and improves the endurance.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: May 6, 2025
    Assignee: SHENZHEN GOODIX TECHNOLOGY CO., LTD.
    Inventors: Zhiyao Liu, Shuqing Cheng
  • Patent number: 12282516
    Abstract: A method includes extracting a set of candidate keywords from clickstream data and natural language processing of product text for a plurality of search queries. The set of candidate keywords are filtered based on the clickstream data. The set of candidate keywords as filtered are ranked based on the clickstream data. The set of candidate keywords as ranked are clustered to remove near duplicates. The set of candidate keywords as ranked for a respective search query is output.
    Type: Grant
    Filed: May 6, 2022
    Date of Patent: April 22, 2025
    Assignee: Home Depot Product Authority, LLC
    Inventors: Venkata Goutham Simhadri, Janani Balaji, Jeyaprakash Singarayar, Olga Stolpovskaia, Suhail Shaikh
  • Patent number: 12277155
    Abstract: An online system extracts information from a user for use in workflows using a machine learning-based language mode. The online system creates a weighted epoch tree comprising epoch nodes, each epoch node associated with a time interval associated with the user. An epoch node has a relevance score determined based on a set of events associated with the user that occurred during a time interval. The online system builds the weighted epoch tree by selecting an epoch node for further exploration based on relevance scores and determining a question relevant to a context represented by the selected epoch node. The online system determines an answer to the question and either adds the answer to an existing node or to new epoch nodes added to the weighted epoch tree. The online system may use the weighted epoch tree for generating a synthetic statement for the user.
    Type: Grant
    Filed: November 22, 2024
    Date of Patent: April 15, 2025
    Inventor: Yashraj Panwar
  • Patent number: 12266373
    Abstract: A method and apparatus for audio processing, an electronic device and a storage medium are provided. The method includes: obtaining an audio encoding result, wherein each element in the audio encoding result has a coordinate in an audio frame number dimension and a coordinate in a text label sequence dimension; in response to an output result of an ith frame in a decoding path being a non-null character, respectively increasing the coordinate in the audio frame number dimension and the coordinate in the text label sequence dimension corresponding to an output position of the ith frame by 1 to obtain an output position of a (i+1)th frame in the decoding path; and determining an output result corresponding to the output position of the (i+1)th frame according to the output result of the ith frame in the decoding path and an element of the (i+1)th frame in the audio encoding result.
    Type: Grant
    Filed: December 9, 2022
    Date of Patent: April 1, 2025
    Assignee: BEIJING XIAOMI MOBILE SOFTWARE CO., LTD.
    Inventors: Mingshuang Luo, Fangjun Kuang, Liyong Guo, Long Lin, Wei Kang, Zengwei Yao, Povey Daniel
  • Patent number: 12260233
    Abstract: Methods and systems described herein for addressing issues associated with varying graph analytics tools that require different tool-specific coding languages. An artificial intelligence (AI) sub-system of various modules extracts metadata from a dataset and identifies nodes and relationships in the dataset using the metadata. The dataset is matched with a corresponding graph-analytics template in a data store, and a dynamic template modifier modifies the corresponding graph-analytics template. In some examples, the AI system generates smart guided videos with logical breakpoints that are embedded along with templates for quick learning and to build faster graphical analytics. The AI system includes a dynamic template modifier and a cognitive smart AI engine that includes a graph.
    Type: Grant
    Filed: November 2, 2022
    Date of Patent: March 25, 2025
    Assignee: Bank of America Corporation
    Inventors: Siva Paini, Sakshi Bakshi
  • Patent number: 12249170
    Abstract: The present embodiments relate to a language identification system for predicting a language and text content of text lines in an image-based document. The language identification system uses a trainable neural network model that integrates multiple neural network models in a single unified end-to-end trainable architecture. A CNN and an RNN of the model can process text lines and derive visual and contextual features of the text lines. The derived features can be used to predict a language and text content for the text line. The CNN and the RNN can be jointly trained by determining losses based on the predicted language and content and corresponding language labels and text labels for each text line.
    Type: Grant
    Filed: August 26, 2022
    Date of Patent: March 11, 2025
    Assignee: Oracle International Corporation
    Inventors: Liyu Gong, Yuying Wang, Zhonghai Deng, Iman Zadeh, Jun Qian
  • Patent number: 12243019
    Abstract: This disclosure relates to the field of project related document analysis. Conventionally, process of retrieving right information related to a stakeholder with relevant project initiation concerns involves manual intervention resulting in more time consumption. The method of the present disclosure describes a system and method for automated extraction and classification of project initiation related information from request for proposal response documents The RFP response document is parsed using document structure based parsing technique to identify questions and answers. The questions from the RFP response document are classified into different classes of interest and important information from the answers are extracted and mapped to an identified class. The method of the present disclosure demonstrates significant improvement in terms of time consumption by reducing volume of information and providing a quick access to class-specific relevant information from the RFP response document.
    Type: Grant
    Filed: November 29, 2022
    Date of Patent: March 4, 2025
    Assignee: TATA CONSULTANCY SERVICES LIMITED
    Inventors: Asha Sushilkumar Rajbhoj, Padmalata Venkata Nistala, Vinay Kulkarni
  • Patent number: 12243522
    Abstract: Methods and devices are provided in which one or more identifiers are registered based on a user setting made by a user of an electronic device. Each of the one or more identifiers corresponds to at least one activated service module. A registered identifier, of the one or more identifiers, is extracted from a user command input through an input device. The user command is changed using a basic identifier preset for a first service module corresponding to the registered identifier. The changed user command is transmitted to a server configured to control execution of the first service module. A result of executing the changed user command based on the first service module is received from the server.
    Type: Grant
    Filed: August 8, 2022
    Date of Patent: March 4, 2025
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Duseok Kim, Suneung Park
  • Patent number: 12236194
    Abstract: A method of this disclosure may include performing a named entity recognition on text information related to requirements for a wireframe by a first artificial intelligence (AI) model, so as to extract entities and relations of the entities from the text information. The method may further comprise inputting the extracted entities and relations to a second AI model to generate the wireframe, wherein the second AI model is trained so that a difference between resultant relations of the entities of the generated wireframe and the extracted relations of the entities from the first AI model is decreased.
    Type: Grant
    Filed: October 20, 2022
    Date of Patent: February 25, 2025
    Assignee: International Business Machines Corporation
    Inventors: Zhaoqi Wu, Yi Fang Chen, Zhi Wang, Yi Qun Zhang, Yan Du, Li Na Yuan
  • Patent number: 12230257
    Abstract: Various implementations relate to techniques, for controlling smart devices, that are low latency and/or that provide computational efficiencies (client and/or server) and/or network efficiencies. Those implementations relate to generating and/or utilizing cache entries, of a cache that is stored locally at an assistant client device, in control of various smart devices (e.g., smart lights, smart thermostats, smart plugs, smart appliances, smart routers, etc.). Each of the cache entries includes a mapping of text to one or more corresponding semantic representations.
    Type: Grant
    Filed: September 15, 2023
    Date of Patent: February 18, 2025
    Assignee: GOOGLE LLC
    Inventors: David Roy Schairer, Di Lin, Lucas Palmer
  • Patent number: 12212612
    Abstract: In one aspect, an apparatus may include at least one processor and storage accessible to the at least one processor. The storage may include instructions executable by the at least one processor to receive a transcription of audio from a first client device. The audio may be detected at the first client device and may be streamed from the first client device as part of a video conference. The instructions may also be executable to determine that a second client device is not presenting a first part of the audio. Based on the determination, the instructions may be executable to send a first part of the transcription to the second client device and/or to present the first part the transcription at the second client device.
    Type: Grant
    Filed: March 31, 2022
    Date of Patent: January 28, 2025
    Assignee: Lenovo (Singapore) Pte. Ltd.
    Inventors: Joshua Smith, Inna Zolin, Carl H. Seaver, Matthew Fardig
  • Patent number: 12211498
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, a natural-language user input is received at an electronic device and a user intent determined. Where an audio recording corresponding to the intent is available, a digital assistant of the electronic device provides a first spoken output introducing the audio recording, the audio recording itself, and a second spoken output indicating the end of the audio recording.
    Type: Grant
    Filed: February 16, 2022
    Date of Patent: January 28, 2025
    Assignee: Apple Inc.
    Inventors: Michael Scott Hergenrader, Ryann Stevenson Brewer, Hiu Yi Chan, Eunice Y. Lee, Shahbaaz Tajuddin Mhaisale
  • Patent number: 12206820
    Abstract: Systems and methods for processing calls to determine if the call is potentially fraudulent or unwanted. The system extracts a speech signal from an audio signal associated with a call. The system identifies audio characteristics based on analysis of the audio signal. The system generates textual transcript of the audio signal based on automatic speech recognition of the speech signal, which is used to assign text categories for the call based on an automated multi-label textual classification of the textual transcript. The system assigns audio categories for the call based on automated multi-label acoustic classification of the speech signal. The system generates an output label for the call based on a combined analysis of the text categories, the audio categories, and the audio characteristics. The language spoken during the call may be detected and used to generate the textual transcript and to assign the text categories and the audio categories.
    Type: Grant
    Filed: July 29, 2022
    Date of Patent: January 21, 2025
    Assignee: REALNETWORKS LLC
    Inventors: Branimir Dropuljic, Michael J. Bordash
  • Patent number: 12205613
    Abstract: A communication system and a method can be configured to facilitate the performance of a conference. The system can include a conference organizer terminal and at least two participants' terminals each assigned to respective conference participants who each log in to start a conference on the communication system. The communication system can be configured to calculate a decision situation at a particular point in time of the ongoing conference by analyzing the views expressed by the conference participants during the conference and send data relating to the decision situation for that point in time to the conference organizer's terminal and/or other conference participant terminals for use in facilitating the conference. IN some embodiments, such data can be used to assist the conference participants' in recognizing when there is a consensus made on at least one decision to be made during the conference.
    Type: Grant
    Filed: June 11, 2020
    Date of Patent: January 21, 2025
    Assignee: RINGCENTRAL, INC.
    Inventors: Jurgen Totzke, Karl Klug
  • Patent number: 12197817
    Abstract: This relates to systems and processes for using a virtual assistant to arbitrate among and/or control electronic devices. In one example process, a first electronic device samples an audio input using a microphone. The first electronic device broadcasts a first set of one or more values based on the sampled audio input. Furthermore, the first electronic device receives a second set of one or more values, which are based on the audio input, from a second electronic device. Based on the first set of one or more values and the second set of one or more values, the first electronic device determines whether to respond to the audio input or forego responding to the audio input.
    Type: Grant
    Filed: August 18, 2023
    Date of Patent: January 14, 2025
    Assignee: Apple Inc.
    Inventors: Kurt W. Piersol, Ryan M. Orr, Daniel J. Mandel
  • Patent number: 12198680
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate multi-task training a recurrent neural network transducer (RNN-T) using automatic speech recognition (ASR) information are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include an RNN-T that can receive ASR information. The computer executable components can include a voice activity detection (VAD) model that trains the RNN-T using the ASR information, where the RNN-T can further comprise an encoder and a joint network. One or more outputs of the encoder can be integrated with the joint network and one or more outputs of the VAD model.
    Type: Grant
    Filed: July 28, 2022
    Date of Patent: January 14, 2025
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sashi Novitasari, Takashi Fukuda, Gakuto Kurata
  • Patent number: 12198693
    Abstract: The present invention discloses an information processing method, an information control center device, and a computer-readable storage medium. The method comprises: obtaining semantic parsing information corresponding to a sound signal, the semantic parsing information including a designated time; performing a time prediction on the designated time based on a current time to determine an intended time; and generating a target instruction corresponding to the sound signal based on the intended time. With this method, the information control center device can process complex and diverse sound signals, and the designated time and target intention in a sound signal can be extracted from the semantic parsing information. Because of the time prediction on the semantic parsing information with the designated time, the designated time provided in the sound signal can be processed more accurately, and the voice interaction process is more accurate.
    Type: Grant
    Filed: November 9, 2020
    Date of Patent: January 14, 2025
    Assignee: AI Speech Co., Ltd.
    Inventors: Yongkai Lin, Shuai Fan, Peng Yang, Ruiting Xu
  • Patent number: 12190897
    Abstract: An apparatus for processing an audio signal having associated therewith a pitch lag information and a gain information, includes a domain converter for converting a first domain representation of the audio signal into a second domain representation of the audio signal; and a harmonic post-filter for filtering the second domain representation of the audio signal, wherein the post-filter is based on a transfer function including a numerator and a denominator, wherein the numerator includes a gain value indicated by the gain information, and wherein the denominator includes an integer part of a pitch lag indicated by the pitch lag information and a multi-tap filter depending on a fractional part of the pitch lag.
    Type: Grant
    Filed: May 16, 2023
    Date of Patent: January 7, 2025
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Christian Helmrich, Goran Markovic, Matthias Neusinger, Sascha Disch, Manuel Jander, Martin Dietz
  • Patent number: 12190885
    Abstract: Configurable core domains of a speech processing system are described. A core domain output data format for a given command is originally configured with default content portions. When a user indicates additional content should be output for the command, the speech processing system creates a new output data format for the core domain. The new output data format is user specific and includes both default content portions as well as user preferred content portions.
    Type: Grant
    Filed: September 18, 2023
    Date of Patent: January 7, 2025
    Assignee: Amazon Technology, Inc.
    Inventors: Rohan Mutagi, Felix Wu, Rongzhou Shen, Neelam Satish Agrawal, Vibhunandan Gavini, Pablo Carballude Gonzalez
  • Patent number: 12182497
    Abstract: An expanded utterance that is used to output a more appropriate output utterance for an utterance can be generated. An utterance sentence expansion device includes an expansion unit that inserts, for an utterance that is an utterance to be expanded that includes a noun and is morphologically analyzed in advance, by using information of an expansion dictionary, which includes higher-level categories of the noun, one or more higher-level categories of the expansion dictionary corresponding to the noun included in the utterance into a position before the noun of the utterance to generate an expanded utterance.
    Type: Grant
    Filed: April 10, 2020
    Date of Patent: December 31, 2024
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Ko Mitsuda, Ryuichiro Higashinaka, Junji Tomita