Modification Of At Least One Characteristic Of Speech Waves (epo) Patents (Class 704/E21.001)
E Subclasses
-
Patent number: 11979360Abstract: The present disclosure provides method and apparatus for responding in a voice conversation by an electronic conversational agent. A voice input may be received in an audio upstream. In response to the voice input, a primary response and at least one supplementary response may be generated. A primary voice output may be generated based on the primary response. At least one supplementary voice output may be generated based on the at least one supplementary response. The primary voice output and the at least one supplementary voice output may be provided in an audio downstream, wherein the at least one supplementary voice output is provided during a time period adjacent to the primary voice output in the audio downstream.Type: GrantFiled: October 25, 2018Date of Patent: May 7, 2024Assignee: Microsoft Technology Licensing, LLCInventor: Li Zhou
-
Patent number: 11971513Abstract: Methods of and systems for forming an image of a subterranean region of interest are disclosed. The method includes obtaining an observed seismic dataset and a seismic velocity model for the subterranean region of interest and generating a simulated seismic dataset based on the seismic velocity model and the source and receiver geometry of the observed seismic dataset. The method also includes forming a plurality of time-windowed trace pairs from the simulated and the observed seismic datasets, and forming an objective function based on a penalty function and a cross-correlation between the members of each pair. The method further includes determining a seismic velocity increment based on the extremum of the objective function and forming an updated seismic velocity model by combining the seismic velocity increment and the seismic velocity model, and forming the image of the subterranean region of interest based on the updated seismic velocity model.Type: GrantFiled: May 21, 2021Date of Patent: April 30, 2024Assignee: SAUDI ARABIAN OIL COMPANYInventors: Weiguang He, Yubing Li, Lu Liu, Yi Luo
-
Patent number: 11960648Abstract: A method for determining a current viewing direction of a user of a pair of data glasses having a virtual retina scan display. The method includes at least the method steps: projecting at least substantially parallel infrared laser beams onto an eye of a user of the data glasses, acquiring two-dimensional images from the infrared laser beams reflected back by the eye of the user, and determining pupil contours in the acquired two-dimensional images. The instantaneous viewing direction of the user of the data glasses is ascertained from a comparison of an instantaneous elliptical shape of the pupil contour with an elliptical shape of a reference pupil contour.Type: GrantFiled: March 24, 2023Date of Patent: April 16, 2024Assignee: ROBERT BOSCH GMBHInventor: Johannes Meyer
-
Patent number: 11960514Abstract: A method of generating content in association with an information search and retrieval system. It begins by receiving a query from a user. The query is semantically-searched to identify a context. A conversation history between the user and the system is identified. An enriched query is then generated by associating to the query both the context and at least a portion of the conversation history. The enriched query is then evaluated/processed by a generative-AI. In response, information associated with the enriched query is received from the generative-AI. A response to the query is then generated using the information, e.g., by passing the information back to the user, by modifying (e.g., editing or supplementing) the information to generate modified information and passing the modified information back to the user, or by dismissing the information. If sensitive information is identified in the utterance, it is masked prior to generating the enriched query.Type: GrantFiled: May 1, 2023Date of Patent: April 16, 2024Assignee: Drift.com, Inc.Inventors: Matt Taylert, Bernard Ngombi Kiyanda, Maria C. Moya, Joseph S. Demple, Matthew Pierce
-
Patent number: 11894009Abstract: An audio processing method applied to a first terminal is described, and includes: in response to receiving of audio data input by a user at the first terminal, and determination that a voice change function is turned on, determining change parameters; and based on the change parameters, performing change processing on the audio data.Type: GrantFiled: January 28, 2022Date of Patent: February 6, 2024Assignee: Beijing Xiaomi Mobile Software Co., Ltd.Inventors: Liujun Zhang, Yuqing Hua, Zhen Yang, Zuojing Li
-
Patent number: 11885632Abstract: Implementations set forth herein relate to pre-emptively initializing an automated assistant in a vehicle according to certain indications, in order to reduce latency while also seeking to preserve computational resources. In some implementations, data for effectuating one or more features of an automated assistant can be loaded into memory of a computing device based on vehicle interaction data. For example, the vehicle interaction data can characterize instances in which the user, from within their vehicle, invoked the automated assistant within a threshold period of time of an application completing an operation. Based on the vehicle interaction data, subsequent instances of the operation being completed while the user is in the vehicle can cause data to be loaded into memory in order to pre-emptively prepare the automated assistant to be utilized by the user.Type: GrantFiled: April 15, 2021Date of Patent: January 30, 2024Assignee: GOOGLE LLCInventors: Vikram Aggarwal, Steven B. Huang
-
Patent number: 11887497Abstract: A method includes, while displaying a first set of text content via a display device, determining an engagement value that characterizes a level of user engagement with respect to the first set of text content. The method includes, in accordance with a determination that the engagement value satisfies a threshold, replacing the first set of text content with a second set of text content via the display device. The first set of text content is different from the second set of text content. The method includes in accordance with a determination that the engagement value does not satisfy the threshold, maintaining display of the first set of text content via the display device.Type: GrantFiled: May 23, 2022Date of Patent: January 30, 2024Assignee: APPLE INC.Inventors: Barry-John Theobald, Russell Y. Webb, Nicholas Elia Apostoloff
-
Patent number: 11880365Abstract: Embodiments of the invention are directed to a system, method, or computer program product for multimodal and distributed database system structured for dynamic latency reduction. In this regard, the invention comprises a unified data layer structured to map a plurality of data storage mechanisms to a common abstraction and a query engine structured for heterogenous domain based data extraction without requiring input of schema-based queries. In some embodiments, the invention comprises determining (i) one or more data components and (ii) one or more associated data domains associated with the first domain-based query by parsing the user input based on derived metadata from data dictionaries associated with a unified data layer system component. Moreover, the invention is configured to extract stored data from each of a plurality of databases based on the associated one or more data domains.Type: GrantFiled: March 23, 2022Date of Patent: January 23, 2024Assignee: BANK OF AMERICA CORPORATIONInventors: Satish Raghavan, Anirudh Kumar Sharma
-
Patent number: 11875165Abstract: Methods, apparatus, systems, and computer-readable media are provided for providing context specific schema files that allow an automated assistant to broker human-to-computer dialogs between a user and an application that is separate from the automated assistant. The context specific schema file can provide the automated assistant with sufficient data to be responsive to user queries without necessarily communicating with a remote device, such as a server. Multiple different context specific schema files can be made available to the automated assistant according to a context in which a user is interacting with the automated assistant. In this way, latency otherwise exhibited by the automated assistant can be mitigated by providing the automated assistant with the information needed to respond to a user without continually retrieving the information over a network.Type: GrantFiled: October 13, 2022Date of Patent: January 16, 2024Assignee: GOOGLE LLCInventors: Justin Lewis, Scott Davies
-
Patent number: 11875485Abstract: An image processing method determines a geometric transform of a suspect image by efficiently evaluating a large number of geometric transform candidates in environments with limited processing resources. Processing resources are conserved by using complementary methods for determining a geometric transform of an embedded signal. One method excels at higher geometric distortion, and specifically, distortion caused by greater tilt angle of a camera. Another method excels at lower geometric distortion, for weaker signals. Together, the methods provide a more reliable detector of an embedded data signal in image across a larger range of distortion while making efficient use of limited processing resources in mobile devices.Type: GrantFiled: May 31, 2022Date of Patent: January 16, 2024Assignee: Digimarc CorporationInventor: Vojtech Holub
-
Patent number: 11875786Abstract: A Natural Language Command system, which receives Natural Language Commands either over a voice system or over a text system. The commands are associated with the session, and thus can be modified.Type: GrantFiled: March 10, 2020Date of Patent: January 16, 2024Inventor: Scott C Harris
-
Parallel hypothetical reasoning to power a multi-lingual, multi-turn, multi-domain virtual assistant
Patent number: 11869497Abstract: A virtual assistant system comprising an interface configured to receive user input and provide a response to the user and a processor configured to run machine executable code. A memory storing non-transitory machine executable code configured to process the user input to generate two or more primary interpretations and one or more secondary interpretations based on one or more of the two or more primary interpretations. The code is also configured to process the primary interpretations and alternative interpretations to generate results which lead to two or more terminal states and then score the two or more terminal states to rank the two or more terminal states such that a top ranked terminal state is the top result, which is presented to the user. A transceiver may communicate over a network to a second device configured to assist the virtual assistant system in generating the top result for the user.Type: GrantFiled: March 10, 2021Date of Patent: January 9, 2024Assignee: MeetKai, Inc.Inventor: James Kaplan -
Patent number: 11843565Abstract: Techniques that facilitate a dialogue system based on contextual information are provided. In one example, a system includes a contextual information component and a dialogue routing component. The contextual information component determines contextual information associated with a user identity based on a statement related to communication information received by a computing device associated with the user identity. The dialogue routing component generates a path traversal for a dialogue system based on the contextual information to facilitate generation of a response to the statement by the dialogue system.Type: GrantFiled: September 19, 2019Date of Patent: December 12, 2023Inventors: Sunhwan Lee, Saurabh Mishra
-
Patent number: 11837245Abstract: A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.Type: GrantFiled: November 1, 2022Date of Patent: December 5, 2023Assignee: AUDIOSHAKE, INC.Inventor: Luke Miner
-
Patent number: 11837244Abstract: An analysis filter bank corresponding to multiple sub-bands, which performs frequency-division filtering on an input signal to generate multiple sub-band signals, the analysis filter bank comprising: a sub-band response pre-compensator which performs a linear filtering on the input signal to generate a response pre-compensated signal, multiple sub-filters with different central frequencies, which perform complex-type first-order infinite impulse response filtering respectively on the response pre-compensated signal to generate multiple sub-filter signals, and multiple binomially-combining and rotating devices based on a set of binomial weights, each of which performs a weighted summation on at least two of the sub-filter signals with the set of binomial weights, and rotates a weighted-summation result with a rotating phase according to a corresponding sub-band central frequency to generate one of the sub-band signals, wherein the at least two of the sub-filter signals are generated by at least two of the sub-Type: GrantFiled: March 29, 2021Date of Patent: December 5, 2023Assignee: Invictumtech Inc.Inventor: Ming-Luen Liou
-
Patent number: 11823689Abstract: An apparatus includes a receiver and a decoder. The receiver is configured to receive a bitstream that includes a first frame and a second frame. The first frame includes a first portion of a mid channel and a first quantized stereo parameter. The second frame includes a second portion of the mid channel and a second quantized stereo parameter. The decoder is configured to generate a first portion of a channel based on the first portion of the mid channel and the first quantized stereo parameter. The decoder is configured to, in response to the second frame being unavailable for decoding operations, estimate the second quantized stereo parameter based on stereo parameters of one or more preceding frames and generate a second portion of the channel based on the estimated second quantized stereo parameter. The second portion of the channel corresponds to a decoded version of the second frame.Type: GrantFiled: December 20, 2021Date of Patent: November 21, 2023Assignee: QUALCOMM IncorporatedInventors: Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Venkatraman Atti
-
Patent number: 11815687Abstract: A method performed by a head-mounted device can include, based on a front-facing camera included in the head-mounted device capturing an image of a wearable device, configuring the head-mounted device to receive input via the wearable device, determining that a gesture received by the wearable device includes a request to launch an application, and, in response to determining that the gesture includes the request to launch the application, launching the application.Type: GrantFiled: March 2, 2022Date of Patent: November 14, 2023Assignee: GOOGLE LLCInventors: Dongeek Shin, Isaac Allen Fehr, Sean Kyungmok Bae, Ding Xu
-
Patent number: 11798555Abstract: A system of reducing transmissions of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify candidate interfaces and determine if prior instances of the packetized data was transmitted to the candidate interfaces. The interface management component can prevent the transmission of the packetized data if determined to be redundant, such as having previously received the data, and instead transmit it to a separate client device of a different device type.Type: GrantFiled: August 3, 2021Date of Patent: October 24, 2023Assignee: GOOGLE LLCInventors: Gaurav Bhaya, Tarun Jain, Anshul Kothari
-
Patent number: 11790891Abstract: Generally discussed herein are devices, systems, and methods for custom wake word selection assistance. A method can include receiving, at a device, data indicating a custom wake word provided by a user, determining one or more characteristics of the custom wake word, determining that use of the custom wake word will cause more than a threshold rate of false detections based on the characteristics, rejecting the custom wake word as the wake word for accessing a personal assistant in response to determining that use of the custom wake word will cause more than a threshold rate of false detections, and setting the custom wake word as the wake word in response to determining that use of the custom wake word will not cause more than the threshold rate of false detections.Type: GrantFiled: December 1, 2021Date of Patent: October 17, 2023Assignee: Microsoft Technology Licensing, LLCInventors: Emilian Stoimenov, Khuram Shahid, Guoli Ye, Hosam Adel Khalil, Yifan Gong
-
Patent number: 11756532Abstract: An intelligence-driven virtual assistant for automated documentation of new ideas is provided. During a brainstorming session, one or more user participants may discuss and identify one or more ideas. Such ideas may be tracked, catalogued, analyzed, developed, and further expanded upon through use of an intelligence-driven virtual assistant. Such virtual assistant may capture user input data embodying one or more new ideas and intelligently process the same in accordance with creativity tool workflows. Such workflows may further guide development and expansion upon a given idea, while continuing to document, analyze, and identify further aspects to develop and expand.Type: GrantFiled: November 29, 2021Date of Patent: September 12, 2023Assignee: BRIGHT MARBLES, INC.Inventors: John Cronin, Burt Cummings, Charles Root, Michael D′Andrea, Jeffrey Goodwin, Nagesh Kadaba
-
Patent number: 11748722Abstract: Embodiments of the invention provide a method, system and computer program product for online ordering using conversational interfaces. In an embodiment of the invention, the method includes storing customer information corresponding to a customer and responsive to receiving a message with text or speech and an image from the customer, identifying an intent type from the text or speech using Natural Language Understanding, identifying a product or service from the image using image classification techniques and transmitting a product detail message to the customer with the product or service and corresponding pricing using Natural Language Generation. The method further includes responsive to receiving an affirmative message from the customer in response to the product detail message identified as affirmative using Natural Language Understanding, automatically completing a purchase of the product or service with the customer information and transmitting a receipt message to the customer with an order receipt.Type: GrantFiled: April 21, 2021Date of Patent: September 5, 2023Assignee: WIZARD COMMERCE, INC.Inventor: Melissa Bridgeford
-
Patent number: 11740861Abstract: In one embodiment, a computer-implemented method for editing navigation of a content item is disclosed. The method may include presenting, via a user interface at a client computing device, time-synchronized text pertaining to the content item; receiving an input of a tag for the time-synchronized text of the content item, wherein the tag corresponds to a performer that performs at least a portion of the content item at a timestamp in the time-synchronized text; storing the tag associated with the portion of the content item at the timestamp in the time-synchronized text of the content item; and responsive to receiving a request to play the content item: playing the content item via a media player presented in the user interface, and concurrently presenting the time-synchronized text and the tag in the user interface, wherein the tag is presented as a graphical user element in the user interface.Type: GrantFiled: September 15, 2022Date of Patent: August 29, 2023Assignee: Musixmatch S.P.A.Inventors: Marco Paglia, Paolo Spazzini, Pierpaolo Di Panfilo, Niche Chathong, Daria Babeo
-
Patent number: 11741985Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping”(or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the original an input narrowband audio signal. Other embodiments are disclosed.Type: GrantFiled: July 25, 2022Date of Patent: August 29, 2023Assignee: Staton Techiya LLCInventors: John Usher, Dan Ellis
-
Patent number: 11721093Abstract: In one embodiment, a method includes, by a client system, receiving, by an assistant xbot of the client system, a request from a first user for a summary of user content from a first content source, retrieving, from the first content source, a plurality of content items corresponding to the request, generating a personalized summary of the retrieved content items, wherein the personalization of the summary is based on a user profile of the first user, and presenting, by the assistant xbot, the personalized summary responsive to the request within a separate communication interface between the assistant xbot and the first user, wherein the personalized summary is interactable by the first user to react to one or more of the plurality of content items.Type: GrantFiled: March 16, 2021Date of Patent: August 8, 2023Assignee: Meta Platforms, Inc.Inventors: Xiaohu Liu, Baiyang Liu, Rajen Subba, Benoit F. Dumoulin
-
Patent number: 11693903Abstract: An information processing device according to embodiments includes a reception unit that receives sensor data related to a user's action, a storage unit that stores a condition indicated by a threshold based on a psychological burden for information that is associated with the user's action and is to be presented to the user and for presentation of the information associated with the action related to the user or a threshold indicated by a psychological burden function, and a determination unit that determines content and a presentation timing of the information that is associated with the action indicated by the sensor data and is to be presented to the user from information stored in the storage unit when the action indicated by the sensor data received by the reception unit satisfies the condition related to the user and stored in the storage unit.Type: GrantFiled: April 3, 2019Date of Patent: July 4, 2023Assignee: Nippon Telegraph and Telephone CorporationInventors: Reiko Aruga, Tadashi Nunobiki
-
Patent number: 11681364Abstract: An image processing system may receive image data from a camera of a user device and perform gaze prediction processing of the image data to predict one or more gaze patterns. The gaze prediction processing may include processing the image data using a neural network to detect faces and/or objects and generate an image feature map. The gaze prediction processing may include performing gaze direction prediction operations using the feature map and detected faces and/or objects to determine gaze direction probability data. The gaze prediction processing may include predicting a gaze pattern based on the gaze direction probability data and the image feature map. The gaze pattern may be short-term (e.g., atomic-level) or long-term (e.g., event-level).Type: GrantFiled: June 29, 2021Date of Patent: June 20, 2023Assignee: Amazon Technologies, Inc.Inventors: Xu Zhang, Yue Wu, Varsha Hedau, Shih-Fu Chang, Pradeep Natarajan
-
Patent number: 11669860Abstract: Methods, systems, and media for automated compliance determination of content items are provided.Type: GrantFiled: December 11, 2019Date of Patent: June 6, 2023Assignee: Google LLCInventors: Henry Scott-Green, Michael de Ridder, T. J. Gaffney, Brian Mulford, Amund Tveit, Antoine Delaite, Preethi Puducheri Sundar
-
Patent number: 11657408Abstract: Arrangements for synchronously tracking and controlling events across multiple computer systems are provided. In some examples, a user may register with a system and user data may be received. In some arrangements, historical data associated with the user may also be received. Machine learning may be used to analyze the historical data and/or user data and a first recommendation for an item may be generated and transmitted to the user. Upon receiving acceptance of the recommendation, the system may request data from one or more entities. For instance, entity data associated with current inventory, availability of items, layout of locations, and the like, may be received. Based on the received data, a list of items for capture and/or an item capture route may be generated. In some examples, the item capture route may include step-by-step or map-based instructions to capture the items on the list.Type: GrantFiled: January 7, 2020Date of Patent: May 23, 2023Assignee: Bank of America CorporationInventors: Manu Kurian, Matthew E. Carroll
-
Patent number: 11658928Abstract: A virtual content creation method according to an embodiment of the present invention includes, by a server, receiving a model content including at least one of a text, an SMS, a voice-recorded MP3 file, a picture, and a video of a model; by the server, extracting a model feature including at least one of a text feature, a voice feature, an image feature, and a video feature from the model content; and when a user wants to communicate with the model, by the server, being operated based on deep learning or artificial intelligence to allow the user to input a user content to the server, determine a user state by detecting an emotional state of the user from the user content, and transform the model content into the virtual content using the model feature or the user state.Type: GrantFiled: August 7, 2020Date of Patent: May 23, 2023Inventor: Kab Cheon Choe
-
Patent number: 11580999Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.Type: GrantFiled: May 26, 2021Date of Patent: February 14, 2023Assignee: Electronics and Telecommunications Research InstituteInventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Woo-taek Lim, Inseon Jang
-
Patent number: 11540003Abstract: Methods, apparatus, and systems are disclosed for synchronizing streaming media content. An example apparatus includes a storage device, and a processor to execute instructions to identify a first source streaming broadcast media to a first computing device based on an audio fingerprint of audio associated with the broadcast media, identify sources broadcasting the broadcast media streaming to the first computing device, the sources available to a second computing device including the processor, select a second source of the identified sources for streaming the broadcast media to the second computing device, the second source different than the first source, detect termination of the streaming of the broadcast media on the first computing device, the termination corresponding to a termination time of the broadcast media, and automatically start, by using the selected second source, streaming of the broadcast media to the second computing device at the termination time.Type: GrantFiled: March 18, 2021Date of Patent: December 27, 2022Assignee: Gracenote, Inc.Inventors: Suresh Jeyachandran, Roger Tsai, Paul Emmanuel Quinn, Markus K. Cremer
-
Patent number: 11514332Abstract: A method, computer program product, and system for a cognitive dialoguing avatar, the method including identifying a user, a target entity, and a user goal, initiating communication with the target entity, evaluating cognitively a question from a dialog with the target entity, determining cognitively an answer to the question by evaluating stored user information to progress to the user goal, communicating the determined answer to the target entity.Type: GrantFiled: March 26, 2018Date of Patent: November 29, 2022Assignee: International Business Machines CorporationInventors: Adam T. Clark, Nathaniel D. Lee, Daniel J. Strauss
-
Patent number: 11487832Abstract: Implementations are described herein for analyzing existing interactive web sites to facilitate automatic engagement with those web sites, e.g., by automated assistants or via other user interfaces, with minimal effort from the hosts of those websites. For example, in various implementations, techniques described herein may be used to abstract, validate, maintain, generalize, extend and/or distribute individual actions and “traces” of actions that are useable to navigate through various interactive websites. Additionally, techniques are described herein for leveraging these actions and/or traces to automate aspects of interaction with a third party website.Type: GrantFiled: May 9, 2019Date of Patent: November 1, 2022Assignee: GOOGLE LLCInventors: Gökhan Bakir, Andre Elisseeff, Torsten Marek, João Paulo Pagaime da Silva, Mathias Carlen, Dana Ritter, Lukasz Suder, Ernest Galbrun, Matthew Stokes, Marcin Nowak-Przygodzki, Mugurel-Ionut Andreica, Marius Dumitran
-
Patent number: 11488281Abstract: A multichannel interpolator has an input that receives input data that consists of interleaved channel data from a plurality of data channels. A block random access memory (BRAM) stores data samples from the input data received from the input. Input control logic receives the data samples from the input and places the data samples into the BRAM. Interpolator logic interpolates the data samples to produce output data. The output data is interpolated at an interpolation ratio programmed by a user. The interpolator logic includes a phase generator that calculates a value indicating the interpolation ratio, and a fractional template block that provides a fractional template used to interpolate the data samples to produce the output data, the fraction template block selecting, based on the value calculated by the phase generator. The fractional template is used to interpolate the data samples to produce the output data.Type: GrantFiled: February 8, 2021Date of Patent: November 1, 2022Assignee: Keysight Technologies, Inc.Inventor: Garrett Foltz
-
Patent number: 11481185Abstract: In one embodiment, a computer-implemented method for editing navigation of a content item is disclosed. The method may include presenting, via a user interface at a client computing device, time-synchronized text pertaining to the content item; receiving an input of a tag for the time-synchronized text of the content item, wherein the tag corresponds to a performer that performs at least a portion of the content item at a timestamp in the time-synchronized text; storing the tag associated with the portion of the content item at the timestamp in the time-synchronized text of the content item; and responsive to receiving a request to play the content item: playing the content item via a media player presented in the user interface, and concurrently presenting the time-synchronized text and the tag in the user interface, wherein the tag is presented as a graphical user element in the user interface.Type: GrantFiled: June 17, 2021Date of Patent: October 25, 2022Assignee: Musixmatch S.P.A.Inventors: Marco Paglia, Paolo Spazzini, Pierpaolo Di Panfilo, Niche Chathong, Daria Babco
-
Patent number: 11474841Abstract: Methods, apparatus, systems, and computer-readable media are provided for providing context specific schema files that allow an automated assistant to broker human-to-computer dialogs between a user and an application that is separate from the automated assistant. The context specific schema file can provide the automated assistant with sufficient data to be responsive to user queries without necessarily communicating with a remote device, such as a server. Multiple different context specific schema files can be made available to the automated assistant according to a context in which a user is interacting with the automated assistant. In this way, latency otherwise exhibited by the automated assistant can be mitigated by providing the automated assistant with the information needed to respond to a user without continually retrieving the information over a network.Type: GrantFiled: January 23, 2019Date of Patent: October 18, 2022Assignee: GOOGLE LLCInventors: Justin Lewis, Scott Davies
-
Patent number: 11461708Abstract: There is provided a controller that executes: calculating, based on information about behavior of a user, likelihood about future movement of the user; and outputting information about a service for the movement of the user based on the calculated likelihood.Type: GrantFiled: December 23, 2020Date of Patent: October 4, 2022Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHAInventors: Yurika Tanaka, Daiki Kaneichi
-
Patent number: 11444839Abstract: System for optimizing bandwidth during an online meeting comprises a plurality of data processing systems, wherein each of the data processing systems is associated with a user and comprises a processor, and a memory comprising a digital client and a digital client display interface, wherein the processor causes the digital client to publish an audio-visual stream comprising a video component and an audio component from the corresponding data processing system. A first data processing system, among the plurality of data processing systems is configured to receive an instruction to optimize the bandwidth by limiting the number of data processing systems from which an audio-visual stream is to be played in the first digital client display interface. Further, the first data processing system may play, in the first digital client display interface, an audio-visual stream from each of the number of data processing systems as instructed by the first user.Type: GrantFiled: May 27, 2021Date of Patent: September 13, 2022Inventor: Kishore Daggubati
-
Patent number: 11437030Abstract: Selectively performing voice recognition using one device among multiple devices that recognize and execute the voice recognition based on at least one of apparatus information of the multiple devices and a function parsed from a result of the voice recognition. Thereby, only a single preferable device in an environment in which multiple devices exist, which can service the user input via voice recognition, actually responds to the voice input and services the voice input of the user.Type: GrantFiled: October 17, 2018Date of Patent: September 6, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Chan-hee Choi
-
Patent number: 9043733Abstract: In one example, a method includes receiving an indication of an input gesture detected at a presence-sensitive input device, where the input gesture includes one or more input points and each input point is detected at a respective location of the presence-sensitive input device. The method may also include determining a focal point of the input gesture, and determining a radius length. The method may also include determining a shape centered at the focal point and having a size determined based on the radius length. The method may also include responding to a change in a geometric property of the shape by scaling information included in a graphical user interface, where the scaling of the information being centered at the focal point.Type: GrantFiled: March 15, 2013Date of Patent: May 26, 2015Assignee: Google Inc.Inventors: Winson Chung, Adam William Powell, Svetoslav Ganov, Michael Adam Cohen
-
Patent number: 8994522Abstract: The described method and system provide for HMI steering for a telematics-equipped vehicle based on likelihood to exceed eye glance guidelines. By determining whether a task is likely to cause the user to exceed eye glance guidelines, alternative HMI processes may be presented to a user to reduce ASGT and EORT and increase compliance with eye glance guidelines. By allowing a user to navigate through long lists of items through vocal input, T9 text input, or heuristic processing rather than through conventional presentation of the full list, a user is much more likely to comply with the eye glance guidelines. This invention is particularly useful in contexts where users may be searching for one item out of a plurality of potential items, for example, within the context of hands-free calling contacts, playing back audio files, or finding points of interest during GPS navigation.Type: GrantFiled: May 26, 2011Date of Patent: March 31, 2015Assignees: General Motors LLC, GM Global Technology Operations LLCInventors: Steven C. Tengler, Bijaya Aryal, Scott P. Geisler, Michael A. Wuergler
-
Patent number: 8749405Abstract: An object of the invention is also a navigation system having an input device for the input of an input scale value, having a display device for displaying road map information according to a selected display scale value and having a processor device, wherein the number of enterable input scale values is larger than the number of the selectable display scale values.Type: GrantFiled: March 9, 2012Date of Patent: June 10, 2014Assignee: Bayerische Motoren Werke AktiengesellschaftInventors: Karsten Knebel, Liza Hassel, Frank Wolf
-
Publication number: 20140129231Abstract: A computer program product comprises computer usable program code for receiving data describing a proposed electronic transaction between first and second communications devices. Additional computer usable program code is provided for generating a first audio signal by sound detected by a first microphone of the first communications device, and for generating a second audio signal by sound detected by a second microphone that is part of the second communications device. Still further computer usable program code provides for authenticating that the first communications device and the second communications device are in the same proximity in response to determining that the first and second audio signals were produced by the same sound event, and for completing the proposed electronic transaction between the first and second communications device in response to authenticating that the first and second communications devices are in close proximity.Type: ApplicationFiled: November 2, 2012Publication date: May 8, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Dean F. Herring, Ethan G. Holder, Brad M. Johnson, III, Adrian X. Rodriguez, Jeffrey J. Smith
-
Publication number: 20140122085Abstract: Embodiments of the present general inventive concept provide a voice controlled vibration data analyzer system, including a vibration sensor to detect vibration data from a machine-under-test, a data acquisition unit to receive the vibration data from the vibration sensor, and a control unit having a user interface to receive manual and audio input from a user, and to communicate information relating to the machine-under-test, the control unit executing commands in response to the manual or audio input to control the data acquisition unit and/or user interface to output an audio or visual message relating to a navigation path of multiple machines to be tested, to collect and process the vibration data, and to receive manual or audio physical observations from the user to characterize collected vibration data.Type: ApplicationFiled: October 26, 2012Publication date: May 1, 2014Applicant: Azima Holdings, Inc.Inventors: Kenneth Ralph Piety, K. C. Dahl
-
Publication number: 20140114664Abstract: Embodiments of methods and systems for dominant speaker identification in video conferencing are described. In one embodiment, the computer-implemented method includes identifying one or more dominant speakers in a video conference. The method may also include generating a list of the one or more dominant speakers. Additionally, the method may include communicating the list of one or more dominant speakers to clients in a video conferencing system. In a further embodiment, the method includes communicating the list of the one or more dominant speakers to a client in response to the client joining the video conference.Type: ApplicationFiled: October 20, 2012Publication date: April 24, 2014Applicant: MICROSOFT CORPORATIONInventors: Humayun M. Khan, Jiannan Zheng, Timothy M. Moore
-
Publication number: 20140100853Abstract: An interactive voice response system, comprising: a processor configured to control the output of voice prompts for transmission to a user; an alphanumeric string generator controllable by the processor to generate a random or pseudo-random alphanumeric string for outputting by the processor to a user in natural language form; an input module for receiving a user response and configured to recognize alphanumeric characters in the user response and to output a recognized string of one or more alphanumeric characters recognized in the user response; and a validation module.Type: ApplicationFiled: October 5, 2012Publication date: April 10, 2014Applicant: TOUCH NETWORKS PTY LTDInventor: Jason Andrew Van
-
Publication number: 20140098233Abstract: An access control reader enhances audio data captured by a beamforming microphone array. The access control reader determines a direction to a user and then utilizes beamforming in the direction of the user to enhance the user's voice. The user's enhanced voice is then transmitted to security personnel or a control system to validate the user's identity, in one example.Type: ApplicationFiled: October 5, 2012Publication date: April 10, 2014Applicant: SENSORMATIC ELECTRONICS, LLCInventors: Walter A. Martin, Martin J. Donaghy
-
Publication number: 20140095153Abstract: Methods and apparatus to provide speech privacy are disclosed. An example method includes forming a sampling block based on a first received audio sample, the sampling block representing speech of a user, creating, with a processor, a mask based on the sampling block, the mask to reduce the intelligibility of the speech of the user, wherein the mask is created by converting the sampling block from a time domain to a frequency domain to form a frequency domain sampling block, identifying a first peak within the frequency domain sampling block, demodulating the frequency domain sampling block at the first peak to form a first envelope of the sampling block, distorting the first envelope to form a first distorted envelope, and emitting an acoustic representation of the mask via a speaker.Type: ApplicationFiled: September 28, 2012Publication date: April 3, 2014Inventor: Rafael de la Guardia Gonzales
-
Publication number: 20140095166Abstract: In a method for deep tagging a recording, a computer records audio comprising speech from one or more people. The computer detects a non-speech sound within the audio. The computer determines that the non-speech sound corresponds to a type of sound, and in response, associates a descriptive term with a time of occurrence of the non-speech sound within the recorded audio to form a searchable tag. The computer stores the searchable tag as metadata of the recorded audio.Type: ApplicationFiled: September 28, 2012Publication date: April 3, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Denise A. Bell, Lisa Seacat DeLuca, Jana H. Jenkins, Jeffrey A. Kusnitz
-
Publication number: 20140081643Abstract: Systems, methods, and non-transitory computer-readable storage media for determining expertise through speech analytics. The system associates speakers with respective segments of an audio conversation to yield associated speaker segments. The system also identifies a number of times a speaker has spoken about a topic in the audio conversation by searching the associated speaker segments for a term associated with the topic. The system then ranks the speaker as an expert in the topic when the number of times the speaker has spoken about the topic in the audio conversation exceeds a threshold. The audio conversation can include a compilation of a plurality of audio conversations. Moreover, the system can tag the associated speaker segments having the term with keyword tags and match a respective segment from the associated speaker segments with the speaker, the respective segment having a keyword tag.Type: ApplicationFiled: September 14, 2012Publication date: March 20, 2014Applicant: Avaya Inc.Inventors: Ajita JOHN, Michael J. SAMMON, Reinhard KLEMM, Doree Duncan SELIGMANN