Patents Examined by Qi Han
-
Patent number: 12205577Abstract: Techniques for rendering visual content, in response to one or more utterances, are described. A device receives one or more utterances that define a parameter(s) for desired output content. A system (or the device) identifies natural language data corresponding to the desired content, and uses natural language generation processes to update the natural language data based on the parameter(s). The system (or the device) then generates an image based on the updated natural language data. The system (or the device) also generates video data of an avatar. The device displays the image and the avatar, and synchronizes movements of the avatar with output of synthesized speech of the updated natural language data. The device may also display subtitles of the updated natural language data, and cause a word of the subtitles to be emphasized when synthesized speech of the word is being output.Type: GrantFiled: March 30, 2021Date of Patent: January 21, 2025Assignee: Amazon Technologies, Inc.Inventors: Taehwan Kim, Sanqiang Zhao, Robinson Piramuthu, Seokhwan Kim, Yang Liu, Gokhan Tur, Eshan Bhatnagar
-
Patent number: 12198717Abstract: Examples are disclosed to perform signature matching using noise cancellation models to achieve consensus. Example apparatus disclosed herein include a signature matcher to compare a first stream of monitored media signatures to streams of reference signatures representative of corresponding reference media to determine a first signature match, and compare a second stream of monitored media signatures to the streams of reference signatures to determine a second signature match; a match selector to use at least one the first signature match or the second signature match to identify a first one of the reference media corresponding to the monitored media data; and a creditor interface to output identification data for the first one of the reference media identified with the at least one the first signature match or the second signature match, the identification data to be used to credit a media exposure corresponding to the monitored media.Type: GrantFiled: November 14, 2022Date of Patent: January 14, 2025Assignee: The Nielsen Company (US), LLCInventors: Jeremey M. Davis, Christen V. Nielsen, Kevin Keqiang Deng, Alexander Topchy
-
Patent number: 12197879Abstract: An electronic device obtains a to-be-translated sentence. The electronic device divides the to-be-translated sentence into a preset quantity of clauses. The electronic device separately translates each of the clauses to obtain a respective translation result corresponding to each of the clauses. The electronic device combines the respective translation results corresponding to each of the clauses according to semantics to obtain a target translation sentence corresponding to the to-be-translated sentence.Type: GrantFiled: March 31, 2022Date of Patent: January 14, 2025Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Qiu Ran, Yankai Lin, Peng Li, Jie Zhou
-
Patent number: 12189684Abstract: Methods and apparatus to audio watermarking and watermark detection and extracted are described herein. An example method includes receiving a media content signal, sampling the media content signal to generate samples, storing the samples in a buffer, determining a first sequence of samples in the buffer, determining a second sequence of samples in the buffer, wherein the second sequence of samples is of substantially equal length as the first sequence of samples, calculating an average of the first sequence of samples and the second sequence of samples to generate an average sequence of samples, extracting an identifier from the average sequence of samples, and storing the identifier in a tangible memory.Type: GrantFiled: October 11, 2023Date of Patent: January 7, 2025Assignee: The Nielsen Company (US), LLCInventors: Venugopal Srinivasan, Alexander Topchy
-
Patent number: 12170098Abstract: The present disclosure discloses a sound detection method. The method includes: obtaining an initial sound signal and a spatial distribution spectrum of the initial sound signal; segmenting the initial sound signal, to obtain a target sound segment, and obtaining a timestamp corresponding to the target sound segment, the target sound segment including a speech of at least one object, and the timestamp being used for indicating a start time of the target sound segment and an end time of the target sound segment; segmenting the spatial distribution spectrum by using the timestamp, to obtain a spatial distribution spectrum segment corresponding to the target sound segment; and inputting the target sound segment and the spatial distribution spectrum segment into a sound detection model, to obtain a first sound detection result, the first sound detection result being used for describing whether sound of multiple objects exists in the initial sound signal.Type: GrantFiled: August 26, 2022Date of Patent: December 17, 2024Assignee: Alibaba Damo (Hangzhou) Technology Co., Ltd.Inventors: Shiliang Zhang, Siqi Zheng, Weilong Huang
-
Patent number: 12165672Abstract: A nonverbal information generation apparatus includes a display unit that partitions text into predetermined units, displays the text partitioned into the predetermined units, and makes nonverbal information that represents information about behavior of a verbal output agent or nonverbal information that represents information about behavior of a receiver of verbal information of the verbal output agent that corresponds to the text when the verbal output agent outputs the verbal information visible in association with the predetermined units of the text.Type: GrantFiled: February 15, 2019Date of Patent: December 10, 2024Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Ryo Ishii, Ryuichiro Higashinaka, Taichi Katayama, Junji Tomita, Nozomi Kobayashi, Kyosuke Nishida
-
Patent number: 12159634Abstract: An improved gain-shape vector quantization is achieved by determining a number of bits to be allocated to a gain adjustment- and shape-quantizer for a plurality of combinations of a current bit rate and a first signal property. The bit allocation is derived by using an average of optimal bit allocations for a training data set. A number of bits to the gain adjustment and the shape quantizers for a plurality of combinations of the bit rate and a first signal are pre-calculated, and a table indicating the number of bits to be allocated to the gain adjustment- and the shape-quantizers for a plurality of combinations of the bit rate and a first signal property is created. In this way, the table can be used for achieving an improved bit allocation.Type: GrantFiled: August 3, 2020Date of Patent: December 3, 2024Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventor: Erik Norvell
-
Patent number: 12153883Abstract: Disclosed herein are system, method, and computer program product embodiments for a text-to-speech system. An embodiment operates by identifying a portion of content of the document, that is to be replaced with a summary associated with the content responsive to a request for an audible version of the document. A first request for the audible version of the document is received. A first summary of the content at a first level of detail is generated. The first summary is audibly output. A second request for additional information is received. A second summary of the content at different level of detail is generated. The second summary of the content at the second level of detail is audibly output.Type: GrantFiled: July 12, 2023Date of Patent: November 26, 2024Assignee: Capital One Services, LLCInventors: Galen Rafferty, Reza Farivar, Anh Truong, Jeremy Goodsitt, Vincent Pham, Austin Walters
-
Patent number: 12147770Abstract: An information processing device includes an analyzing unit configured to analyze text data representing claims included in patent document data to identify constituent components of an invention for each claim included in the claims, and a display control unit configured to cause a display device to display texts indicating each claim in the claims, in a form in which the texts indicating each claim are partitioned into the constituent components.Type: GrantFiled: December 17, 2020Date of Patent: November 19, 2024Assignee: RESONAC CORPORATIONInventors: Chinatsu Tanabe, Hiroko Takashi, Nao Takeuchi, Eriko Takeda, Kenichiro Nakajima, Tomoko Miyashita
-
Patent number: 12148425Abstract: An electronic device is provide that may identify voiceprint data corresponding to a user from among at least one piece of voiceprint data, based on received user voice data, identify a general voice instruction included in the received user voice data, determine user preference information of the user, based on the identified voiceprint data, determine a control action for determining an action to be performed in at least one external device or the electronic device, based on the general voice instruction identified from the received user voice data, determine a personalized voice instruction, based on at least one of the control action or the user preference information, and transmit, to the at least one external device, through the communication circuit, an audio signal corresponding to the personalized voice instruction to be output by the at least one external device, or output the audio signal through a speaker included in the electronic device.Type: GrantFiled: April 12, 2022Date of Patent: November 19, 2024Assignee: Samsung Electronics Co., LtdInventors: Yeseul Hong, Dayoung Lee, Boram Lee
-
Patent number: 12142276Abstract: Described herein is a system for automatically detecting and assigning action items in a real-time conversation and determining whether such action items have been completed. The system detects, during a meeting, a plurality of action items and an utterance that corresponds to a completed action item. Responsive to detecting the utterance, the system generates a similarity score with respect to a first action item of the plurality of action items. The system compares the similarity score to a first threshold. Responsive to determining that the similarity score does not exceed the first threshold, the system generates a second similarity score with respect to a second action item of the plurality of action items. The system compares the second similarity score to a second threshold, which exceeds the first threshold. Responsive to determining that the second similarity score exceeds the second threshold, the system marks the second action item as completed.Type: GrantFiled: August 12, 2023Date of Patent: November 12, 2024Assignee: Outreach CorporationInventors: Rohit Ganpat Mane, Abhishek Abhishek, Krishnamohan Reddy Nareddy, Rajiv Garg
-
Patent number: 12136422Abstract: Method, system and product for automatic execution of operations sequences. An operations sequence, which includes a first operation and a second operation, is obtained. The operations sequence comprises at least one user interaction that includes clicking on a clickable element. The operations sequence or portion thereof is automatically executed by mimicking user interactions with the GUI, as indicated in the operations sequence, including by automatically clicking on the clickable element.Type: GrantFiled: October 9, 2022Date of Patent: November 5, 2024Assignee: WALKME LTD.Inventors: Ron Zohar, Moran Shemer
-
Patent number: 12125489Abstract: Techniques for using multiple voice-enabled devices in a user environment to reduce the latency for obtaining responses to user utterances from a remote system. The voice-enabled devices may each establish connections with the remote system to have the remote system perform supplemental speech processing for utterances the devices are unable to process locally. One voice-enabled device may have a higher-latency connection to the remote system, and another voice-enabled device may have a lower-latency connection to the remote system. The lower-latency device may send an utterance to the remote system before the higher-latency device is able, and the remote system may begin processing the utterance faster than if the lower-latency device sent the utterance. The remote system may then provide a response for the utterance to the higher-latency device in less time than if the remote system had to wait for the utterance from the higher-latency device.Type: GrantFiled: July 7, 2023Date of Patent: October 22, 2024Assignee: Amazon Technologies, Inc.Inventors: Sahil Puri, Zhengran Li, Zhan Xu, Oliver Sinsik Chiu, Sembhayya Gollakota, Bruno Dufour
-
Patent number: 12125471Abstract: Systems and methods for correcting recognition errors in speech recognition systems are disclosed herein. Natural conversational variations are identified to determine whether a query intends to correct a speech recognition error or whether the query is a new command. When the query intends to correct a speech recognition error, the system identifies a location of the error and performs the correction. The corrected query can be presented to the user or be acted upon as a command for the system.Type: GrantFiled: June 20, 2023Date of Patent: October 22, 2024Assignee: Rovi Guides, Inc.Inventors: Ankur Anil Aher, Jeffry Copps Robert Jose
-
Patent number: 12118998Abstract: Implementations are set forth herein for creating an order of execution for actions that were requested by a user, via a spoken utterance to an automated assistant. The order of execution for the requested actions can be based on how each requested action can, or is predicted to, affect other requested actions. In some implementations, an order of execution for a series of actions can be determined based on an output of a machine learning model, such as a model that has been trained according to supervised learning. A particular order of execution can be selected to mitigate waste of processing, memory, and network resources—at least relative to other possible orders of execution. Using interaction data that characterizes past performances of automated assistants, certain orders of execution can be adapted over time, thereby allowing the automated assistant to learn from past interactions with one or more users.Type: GrantFiled: August 7, 2023Date of Patent: October 15, 2024Assignee: GOOGLE LLCInventors: Mugurel Ionut Andreica, Vladimir Vuskovic, Joseph Lange, Sharon Stovezky, Marcin Nowak-Przygodzki
-
Patent number: 12100310Abstract: A method includes obtaining a speech proficiency value indicator indicative of a speech proficiency value associated with a user of the electronic device. The method further includes in response to determining that the speech proficiency value satisfies a threshold proficiency value: displaying training text via the display device; obtaining, from the audio sensor, speech data associated with the training text, wherein the speech data is characterized by the speech proficiency value; determining, using a speech classifier, one or more speech characterization vectors for the speech data based on linguistic features within the speech data; and adjusting one or more operational values of the speech classifier based on the one or more speech characterization vectors and the speech proficiency value.Type: GrantFiled: December 8, 2023Date of Patent: September 24, 2024Assignee: APPLE INC.Inventors: Barry-John Theobald, Russell Y. Webb, Nicholas Elia Apostoloff
-
Patent number: 12094487Abstract: An audio system for spatializing virtual sound sources is described. A microphone array of the audio system is configured to monitor sound in a local area. A controller of the audio system identifies sound sources within the local area using the monitored sound from the microphone array and determines their locations. The controller of the audio system generates a target position for a virtual sound source based on one or more constraints. The one or more constraints include that the target position be at least a threshold distance away from each of the determined locations of the identified sound sources. The controller generates one or more sound filters based in part on the target position to spatialize the virtual sound source. A transducer array of the audio system presents spatialized audio including the virtual sound source content based in part on the one or more sound filters.Type: GrantFiled: September 21, 2021Date of Patent: September 17, 2024Assignee: META PLATFORMS TECHNOLOGIES, LLCInventors: Pablo Francisco Faundez Hoffmann, Peter Harty Dodds
-
Patent number: 12093651Abstract: There is a need for more accurate and more efficient natural language solutions with greater semantic intelligence. This need can be addressed, for example, by natural language processing techniques that utilize predictive entity scoring. In one example, a method includes determining an overall prevalence score for the input entity data object with respect to a scored document corpus and a target section; determining a qualified prevalence score for the input entity data object with respect to a high-scoring subset of the scored document corpus; processing the input entity data object using an entity scoring machine learning model to generate the predicted entity score, wherein the entity scoring machine learning model may characterized by a plurality of multiplicative hyper-parameters and one or more additive hyper-parameters; and performing one or more prediction-based actions based at least in part on the predicted entity score.Type: GrantFiled: February 9, 2022Date of Patent: September 17, 2024Assignee: Optum, Inc.Inventors: Nathan H. Funk, Eric D. Tryon, Amy L. Jensen, Sudheer Ponnala, M. P. S. Jagannadha Rao, Raghav Bali, Veera Raghavendra Chikka, Subhadip Maji, Anudeep Srivatsav Appe
-
Patent number: 12087286Abstract: A computing system obtains features that have been extracted from an acoustic signal, where the acoustic signal comprises spoken words uttered by a user. The computing system performs automatic speech recognition (ASR) based upon the features and a language model (LM) generated based upon expanded pattern data. The expanded pattern data includes a name of an entity and a search term, where the entity belongs to a segment identified in a knowledge base. The search term has been included in queries for entities belonging to the segment. The computing system identifies a sequence of words corresponding to the features based upon results of the ASR. The computing system transmits computer-readable text to a search engine, where the text includes the sequence of words.Type: GrantFiled: May 6, 2021Date of Patent: September 10, 2024Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Ankur Gupta, Satarupa Guha, Rupeshkumar Rasiklal Mehta, Issac John Alphonso, Anastasios Anastasakos, Shuangyu Chang
-
Patent number: 12087293Abstract: The invention discloses a smart voice wake-up control method and a control device thereof. The smart voice wake-up control device includes a casing, a main control board, a control switch, a voice pickup, a relay, an output terminal, an input terminal and a work indicator capable of emitting multiple colors of light, the main control board is respectively connected with the control switch, the voice pickup, the relay and the work indicator, and the relay is respectively connected with the output terminal and the input terminal. When multiple smart voice wake-up control devices are used simultaneously, the user can change the light color of the work indicator by controlling the control switch to distinguish different colors of voice wake-up commands, i.e., use the light color of the work indicator to show the color wake-up voice, so that the user can get the wake-up commands intuitively and quickly for controlling.Type: GrantFiled: January 10, 2022Date of Patent: September 10, 2024Assignees: Dongguan Well Shin Electronic Products Co., Ltd., WELL SHIN TECHNOLOGY CO., LTD.Inventors: Jui Hsiung Wu, Chun Xi Ju