Miscellaneous Analysis Or Detection Of Speech Characteristics (epo) Patents (Class 704/E11.001)
-
Patent number: 12190061Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.Type: GrantFiled: December 17, 2021Date of Patent: January 7, 2025Assignee: ADOBE INC.Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
-
Method and system for controlling distributions of attributes in language models for text generation
Patent number: 12153896Abstract: A method for generating a language model for text generation by receiving a pre-trained language model having attributes with existing probability distributions over the pre-trained language model; receiving at least one target constraint; the target constraint specifying an expectation of a target attribute over a language model that approximates the pre-trained language model; computing parameters of an energy based model by applying the target constraint to the pre-trained language model; obtaining samples from a reference policy; updating parameters of a target policy using the obtained samples and the energy based model; updating the reference policy with the target policy if the target policy is superior to the reference policy; and outputting the target policy as a target language model. The target language model is adapted to generate text with the target attribute over a probability distribution that approximates the desired probability distribution specified by the target constraint.Type: GrantFiled: August 2, 2021Date of Patent: November 26, 2024Assignee: Naver CorporationInventors: Marc Dymetman, Hady Elsahar, Muhammad Khalifa -
Patent number: 12149761Abstract: A system and method uses a first device fingerprint for a set-top box (STB) installed within a home theater environment which includes an over-the-top (OTT) device to cause a one a plurality of original equipment manufacturer (OEM) remote control setup procedures to be selected for use to configure an OEM remote control for the STB and the selected one of the plurality of OEM remote control setup procedures uses a second device fingerprint for the OTT device to cause the OEM remote control to be configured to transmit one or more commands to control functional operations of the OTT device.Type: GrantFiled: July 19, 2022Date of Patent: November 19, 2024Assignee: UNIVERSAL ELECTRONICS INC.Inventor: Paul D. Arling
-
Patent number: 12125893Abstract: Describe is a resonator that uses anti-ferroelectric (AFE) materials in the gate of a transistor as a dielectric. The use of AFE increases the strain/stress generated in the gate of the FinFET. Along with the usual capacitive drive, which is boosted with the increased polarization, additional current drive is also achieved from the piezoelectric response generated to due to AFE material. In some embodiments, the acoustic mode of the resonator is isolated using phononic gratings all around the resonator using the metal line above and vias' to body and dummy fins on the side. As such, a Bragg reflector is formed above or below the AFE based transistor. Increased drive signal from the AFE results in larger output signal and larger bandwidth.Type: GrantFiled: April 3, 2023Date of Patent: October 22, 2024Assignee: Intel CorporationInventors: Tanay Gosavi, Chia-Ching Lin, Raseong Kim, Ashish Verma Penumatcha, Uygar Avci, Ian Young
-
Patent number: 12120182Abstract: Systems and methods for modulating content to effect state change are described. A state control system initiates a process for modulating output objects to effect one or more changes in a state profile associated with a user device. The system queries for historical data associated with the user device; determines whether any historical data is identified for user device and in response to determining that historical data is found predicts a current state profile associated with the user device. The system further collects real-time sensor data associated with user device; filters and normalizes the sensor data; and delivers a plurality of output objects to the user device or secondary device(s) based on real-time sensor data.Type: GrantFiled: July 7, 2022Date of Patent: October 15, 2024Assignee: Daily Rays Inc.Inventors: Ashley Saye, Diego I. Medina-Bernal, Jonathon Nostrant
-
Patent number: 12099599Abstract: Apparatuses and methods for determining if a computer program is malware and to which malware class it belongs to. In the method, the behaviour of a computer program is traced by observing the activity of the program. Behaviour sequences comprising API-calls or similar activity of a computer program are then provided into a classifier for classifying the computer program. From the outcome of the classifier, a classification result and the portions relevant to decision can be provided to a person for further confirmation.Type: GrantFiled: November 30, 2021Date of Patent: September 24, 2024Assignee: Huawei Technologies Co., Ltd.Inventors: Moez Draief, Xiang Chen, Konstantin Kutzkov, Kevin Scaman, Milan Vojnovic
-
Patent number: 12062361Abstract: A voice-activated system edge device cooperating with a remote command processor has a state machine defined by a listening mode state and a conversation monitoring mode state. The state machine transitions from the listening mode state to the conversation monitoring mode state in response to a wake word detection. A command accompanying the wake word is transmitted to the remote command processor for execution thereon. The conversation monitoring mode state is maintained for a conversation monitoring window time duration to receive a connection word accompanied by another command transmitted to the remote command processor for further execution thereon.Type: GrantFiled: November 2, 2021Date of Patent: August 13, 2024Assignee: AONDEVICES, INC.Inventors: Mouna Elkhatib, Adil Benyassine, Aruna Vittal, Eli Uc, Daniel Schoch
-
Patent number: 12057135Abstract: This application discloses a speech noise reduction method performed by a computing device. The method includes: obtaining a noisy speech signal, the noisy speech signal including a pure speech signal and a noise signal; estimating a posteriori signal-to-noise ratio and a priori signal-to-noise ratio of the noisy speech signal; determining a speech/noise likelihood ratio in a Bark domain based on the estimated posteriori signal-to-noise ratio and the estimated priori signal-to-noise ratio; estimating a priori speech existence probability based on the determined speech/noise likelihood ratio; determining a gain based on the estimated posteriori signal-to-noise ratio, the estimated priori signal-to-noise ratio, and the estimated priori speech existence probability, the gain being a frequency domain transfer function used for converting the noisy speech signal into an estimation of the pure speech signal; and exporting the estimation of the pure speech signal from the noisy speech signal based on the gain.Type: GrantFiled: April 9, 2021Date of Patent: August 6, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Xuan Ji, Meng Yu
-
Patent number: 12014649Abstract: System and method for personalized book recommendations have been designed to align with a user's current vocabulary and facilitate sustainable vocabulary growth. By analyzing the user's vocabulary size and growth trajectory, the system can recommend books that are appropriately challenging yet comprehensible. This approach enables users to explore books of their choice more effectively, even before their vocabulary reaches a level sufficient for understanding any book in general. In addition, a vocabulary-based flashcard generator assistant system has been incorporated. This assistant system enables the user to rapidly create high-quality, personalized flashcards for optimal vocabulary acquisition. The flashcards contain content that is tailored to the individual's learning style and pace, further promoting sustainable and effective vocabulary growth.Type: GrantFiled: August 3, 2022Date of Patent: June 18, 2024Inventor: Dzmitry Kushal
-
Patent number: 12002462Abstract: A method for programming a stimulation device of a stimulation system using a programming device includes providing a set of programming commands for the programming device that include a first programming command increasing a stimulation amplitude and a second programming command includes decreasing the stimulation amplitude; receiving a verbal communication by a voice command handler of the programming device or in communication with the programming device; determining whether the verbal communication is a trigger word and, when the verbal communication is the trigger word, entering a triggered state, wherein, after entering the triggered state, the programming device remains in the triggered state until a one of at least one stop condition is met; and, when in the triggered state, determining whether the verbal communication is one of the programming commands and, when the verbal communication is one of the programming commands, executing the one of the programming commands.Type: GrantFiled: October 28, 2021Date of Patent: June 4, 2024Assignee: Boston Scientific Neuromodulation CorporationInventors: Jimmy Lee Chao, Vishal Jagannathan, Eugene Mesina, Travis McCoy
-
Patent number: 11977815Abstract: A dialogue processing method and device are provided. The method includes: A dialogue processing device receives dialogue information from user equipment; if the dialogue information does not include slot information that is corresponding to a first slot type and that can determine a service, the dialogue processing device obtains a service identifier set corresponding to the first slot type from a server, and sends the service identifier set to the user equipment; and after a target service identifier is received from the user equipment, the dialogue processing device requests a service corresponding to the target service identifier from the server, and sends execution success information to the user equipment. According to this method, a service item can be presented to a user in a timely manner, and the user can be prevented from initiating a plurality of rounds of dialogues with the dialogue processing device, thereby improving service execution efficiency and further improving use experience of the user.Type: GrantFiled: July 9, 2021Date of Patent: May 7, 2024Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Hongfeng Luo, Yibo Zhang
-
Patent number: 11966663Abstract: Techniques for performing speech processing using multi-modal widget information are described. A system may receive input data corresponding to a user input. The system may also receive widget context data corresponding to one or more multi-modal widgets active at a device. The system may use the widget context data to perform natural language understanding (NLU) processing with respect to the user input, and for selecting a skill component for responding to the user input. The system may send a widget identifier to the skill component when invoking the skill to respond to the user input.Type: GrantFiled: September 29, 2021Date of Patent: April 23, 2024Assignee: Amazon Technologies, Inc.Inventors: Nhat Vu Doan, Nicholas Adam Cummings, Prashant Jayaram Thakare, Jalaj Kumar, Ganesh Prabu Ravi, Chih-Shin Wang, Narenda Gyanchandani
-
Patent number: 11947869Abstract: Provided is an information processing device, an information processing method, and a program, the information processing device including a control unit that dynamically controls output of notification information related to a function corresponding to a gesture regarding function execution of the device based on a recognition status of an operation body that is executing the gesture in a predetermined operation region.Type: GrantFiled: March 23, 2020Date of Patent: April 2, 2024Assignee: SONY GROUP CORPORATIONInventors: Kei Takahashi, Junichi Shimizu, Junichi Nagahara, Manabu Fujiki, Tomohiro Imura, Keiichi Kitahara
-
Patent number: 11947908Abstract: Described herein are system and method embodiments to improve word representation learning. Embodiments of a probabilistic prior may seamlessly integrate statistical disentanglement with word embedding. Different from previous deterministic methods, word embedding may be taken as a probabilistic generative model, and it enables imposing a prior that may identify independent factors generating word representation vectors. The probabilistic prior not only enhances the representation of word embedding, but also improves the model's robustness and stability. Furthermore, embodiments of the disclosed method may be flexibly plugged in various word embedding models. Extensive experimental results show that embodiments of the presented method may improve word representation on different tasks.Type: GrantFiled: April 7, 2021Date of Patent: April 2, 2024Assignee: Baidu USA LLCInventors: Shaogang Ren, Ping Li
-
Patent number: 11922968Abstract: A boundary of a highlight of audiovisual content depicting an event is identified. The audiovisual content may be a broadcast, such as a television broadcast of a sporting event. The highlight may be a segment of the audiovisual content deemed to be of particular interest. Audio data for the audiovisual content is stored, and the audio data is automatically analyzed to detect one or more audio events indicative of one or more occurrences to be included in the highlight. Each audio event may be a brief, high-energy audio burst such as the sound made by a tennis serve. A time index within the audiovisual content, before or after the audio event, may be designated as the boundary, which may be the beginning or end of the highlight.Type: GrantFiled: February 25, 2022Date of Patent: March 5, 2024Assignee: STATS LLCInventors: Mihailo Stojancic, Warren Packard
-
Patent number: 11921775Abstract: Media unit retrieval methods, systems and computer program products are provided that allow a user to search for an item by iteratively presenting media units such as images representing items to the user and receiving user input consisting of selections of the presented media units (including possibly the empty selection). Features, or attributes, a user is interested in, for example semantic features, are inferred from the interaction and media units are retrieved for presentation based on similarity with user-selected media units, through sampling of a probability distribution describing the intent or interests, or combinations of approaches. Accordingly, the user-experience is akin to a conversation about what the user is looking for. Retrieval may be based on both selected and unselected media units and the selection may comprise making a selection with a single action. Further, a database of media units can capture similarity relationships for efficient media unit retrieval.Type: GrantFiled: December 9, 2022Date of Patent: March 5, 2024Assignee: DREAM IT GET IT LIMITEDInventors: Michael Elkaim, Michael Kopp, Kristjan Korjus
-
Patent number: 11907666Abstract: Various embodiments of a system and associated method for anonymization of text without losing semantic utility of text by extracting a latent embedding representation of content with respect to a given task and by learning an optimal strategy for text embedding manipulation to satisfy both privacy and utility requirements are disclosed herein. In particular, the system balances private attribute obfuscation with retained semantic utility.Type: GrantFiled: November 16, 2021Date of Patent: February 20, 2024Assignee: Arizona Board of Regents on Behalf of Arizona State UniversityInventors: Ahmadreza Mosallanezhad, Ghazaleh Beigi, Huan Liu
-
Patent number: 11887590Abstract: Methods and devices for enabling and disabling applications using voice are described herein. In some embodiments, an individual speak an utterance to their electronic device, which may send audio data representing the utterance to a backend system. The backend system may generate text data representing the utterance, and may determine that an intent of the utterance was for an application to be enabled or disabled for their user account on the backend system. If, for instance, the intent was to enable the application, the backend system may receive one or more rules for performing functionalities of the application, as well as one or more sample templates of sample utterances and sample responses that future utterances may use when requesting the application. Furthermore, one or more invocation phrases that may be used within the future utterances to invoke the application may be received, along with slot values for the sample templates.Type: GrantFiled: September 24, 2020Date of Patent: January 30, 2024Assignee: Amazon Technologies, Inc.Inventors: Shaman D'Souza, Ian Suttle, Srikanth Nori, Rajiv Reddy, Amol Kanitkar, Tina Orooji
-
Patent number: 11889142Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.Type: GrantFiled: December 9, 2022Date of Patent: January 30, 2024Assignee: Universal Electronics Inc.Inventors: Thomas Hascher, Menno Koopmans
-
Patent number: 11830380Abstract: Methods, systems and computer program products for automated learning are provided herein. A computer-implemented method includes authenticating a plurality of users for an automated learning session, wherein the plurality of users correspond to at least one device, and providing the automated learning session for the plurality of users. Providing the automated learning session comprises analyzing a plurality of learning models corresponding to one or more of the plurality of users, determining, based on the analysis, one or more activities to be performed by the plurality of users during the automated learning session, and executing the one or more activities on at least one device.Type: GrantFiled: January 10, 2019Date of Patent: November 28, 2023Assignee: International Business Machines CorporationInventors: Smitkumar Narotambhai Marvaniya, Tejas Indulal Dhamecha, Malolan Chetlur, Renuka Sindhgatta, Bikram Sengupta
-
Patent number: 11798579Abstract: A parameter included in a fundamental frequency pattern of a voice can be estimated from the fundamental frequency pattern with high accuracy and the fundamental frequency pattern of the voice can be reconstructed from the parameter included in the fundamental frequency pattern. A learning unit 30 learns a deep generation model including an encoder which regards a parameter included in a fundamental frequency pattern in a voice signal as a latent variable of the deep generation model and estimates the latent variable from the fundamental frequency pattern in the voice signal on the basis of parallel data of the fundamental frequency pattern in the voice signal and the parameter included in the fundamental frequency pattern in the voice signal, and a decoder which reconstructs the fundamental frequency pattern in the voice signal from the latent variable.Type: GrantFiled: February 19, 2019Date of Patent: October 24, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Ko Tanaka, Hirokazu Kameoka
-
Patent number: 11755847Abstract: Embodiments described herein provide adversarial attacks targeting the cross-lingual generalization ability of massive multilingual representations, demonstrating their effectiveness on multilingual models for natural language inference and question answering. An efficient adversarial training scheme can thus be implemented with the adversarial attacks, which takes the same number of steps as standard supervised training and show that it encourages language-invariance in representations, thereby improving both clean and robust accuracy.Type: GrantFiled: January 15, 2021Date of Patent: September 12, 2023Assignee: Salesforce, Inc.Inventors: Samson Min Rong Tan, Shafiq Rayhan Joty
-
Patent number: 11721328Abstract: The present invention discloses a method and apparatus for awakening skills by speech, which are applied to an electronic device. The method for awakening skills by speech includes: recognizing awakening text information corresponding to a speech request message to be processed; invoking a service skill semantic model to determine a target service field corresponding to the awakening text information and a corresponding first confidence, and invoking a knowledge skill semantic model to determine a knowledge reply answer corresponding to the awakening text information and a corresponding second confidence; and selecting to awaken one of a knowledge skill and a target service skill corresponding to the target service field based on the first confidence and the second confidence. Accordingly, the probability of erroneously awakening a skill based on the speech message can be reduced.Type: GrantFiled: October 26, 2020Date of Patent: August 8, 2023Assignee: AI SPEECH CO., LTD.Inventor: Chengya Zhu
-
Patent number: 11663415Abstract: The following relates generally to voice assisted healthcare. In some embodiments, a digital assistant receives audio data, and determines an intent from the audio data. The digital assistant may then match the determined intent to a flow of a set of flows, where the set of flows may include at least one of: (i) submitting a prescription, (ii) refilling a prescription, (iii) changing a pickup location, (iv) requesting a status update for a prescription, or (v) initiating a pharmacy chat session. The matched flow of the set of flows may then be executed.Type: GrantFiled: August 31, 2020Date of Patent: May 30, 2023Assignee: WALGREEN CO.Inventors: Julija Alegra Petkus, Andrew David Schweinfurth, Stephen Elijah Zambo
-
Patent number: 11638086Abstract: A method and an apparatus for enabling adaptive audio signal alteration are described. When an input audio signal is received, a determination of whether the user of an audio device hears the input audio signal is performed based upon brain activity of the user. A determination of whether the user is distracted by the audio signal is performed based upon sensor measurements indicating a physical state of the user. In response to determining that the user hears the input audio signal and that the input audio signal causes the user to be distracted, a determination of configuration parameter(s) is performed. An alteration of audio signal(s) is caused based upon the configuration parameter(s) to obtain modified version(s) of the audio signal(s) that are intended to address the distraction caused by the input audio signal, and output audio signals are output, where the output audio signals include the modified versions.Type: GrantFiled: June 29, 2022Date of Patent: April 25, 2023Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Matthew John Lawrenson, Jan Jasper Van Den Berg, Jacob Ström, Lars Andersson
-
Patent number: 11627189Abstract: Techniques for implementing a “sticky” user ID are described. A system receives first input audio data and determines first speech processing results therefrom. The system also determines a first user ID of a user that spoke an utterance represented in the first input audio data and associates the first user ID with a device, which originated the first input audio data, for a predetermined length of time. The system determines first output data responsive to the first speech processing data and causes the device to present first output content corresponding thereto. The system then receives second input audio data and determines second speech processing results therefrom. The system also determines a time of receipt of the second input audio data is within the predetermined length of time. Based at least in part thereon, the system determined second output data responsive to the second speech processing data using the first user ID.Type: GrantFiled: June 23, 2020Date of Patent: April 11, 2023Assignee: Amazon Technologies, Inc.Inventor: Yu Bao
-
Patent number: 11583239Abstract: A new chest X-ray database, referred to as “ChestX-ray8”, is disclosed herein, which comprises over 100,000 frontal view X-ray images of over 32,000 unique patients with the text-mined eight disease image labels (where each image can have multi-labels), from the associated radiological reports using natural language processing. We demonstrate that these commonly occurring thoracic diseases can be detected and spatially-located via a unified weakly supervised multi-label image classification and disease localization framework, which is validated using our disclosed dataset.Type: GrantFiled: March 26, 2018Date of Patent: February 21, 2023Assignee: The United States of America, as represented by the Secretary, Department of Health and Human ServiceInventors: Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Ronald M. Summers
-
Patent number: 11574008Abstract: Methods and apparatus for audio identification during a performance are disclosed herein. An example apparatus includes at least one memory and at least one processor to transform a segment of audio into a log-frequency spectrogram based on a constant Q transform using a logarithmic frequency resolution, transform the log-frequency spectrogram into a binary image, each pixel of the binary image corresponding to a time frame and frequency channel pair, each frequency channel representing a corresponding quarter tone frequency channel in a range from C3-C8, generate a matrix product of the binary image and a plurality of reference fingerprints, normalize the matrix product to form a similarity matrix, select an alignment of a line in the similarity matrix that intersects one or more bins in the similarity matrix with the largest calculated Hamming similarities, and select a reference fingerprint based on the alignment.Type: GrantFiled: November 23, 2020Date of Patent: February 7, 2023Assignee: Gracenote, Inc.Inventors: Dale T. Roberts, Bob Coover, Nicola Marcantonio, Markus K. Cremer
-
Patent number: 11570504Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.Type: GrantFiled: November 6, 2020Date of Patent: January 31, 2023Assignee: Universal Electronics Inc.Inventors: Thomas Hascher, Menno Koopmans
-
Patent number: 11556308Abstract: An information processing system includes: an image display apparatus provided in a space and configured to display an image; a sensor apparatus carried by a user who is present in the space and configured to output a signal for detecting position information of the user in the space; and an information processing apparatus. The information processing apparatus includes circuitry configured to store a plurality of pieces of position information of a plurality of users including the user, who are in present in the space, in association with the plurality of users, the plurality of users being detected based on signals output from a plurality of sensor apparatuses including the sensor apparatus, and control environment effect production that supports communication between the plurality of users by the image displayed by the image display apparatus, based on each of the plurality of pieces of position information of the plurality of users.Type: GrantFiled: February 12, 2021Date of Patent: January 17, 2023Assignee: RICOH COMPANY, LTD.Inventor: Haruki Murata
-
Patent number: 11532300Abstract: A device with a microphone acquires audio data of a user's speech. A neural network accepts audio data as input and provides sentiment data as output. The neural network is trained using training data based on input from raters who provide votes as to which sentiment descriptors they think are associated with a sample of speech. A vote by a rater assessing the sample for a particular semantic descriptor is distributed to a plurality of semantically similar semantic descriptors. Semantic descriptor similarity data indicates relative similarity between possible semantic descriptors in the semantic space. The distributed partial votes may be aggregated to produce training data comprising samples of speech and weights of corresponding semantic descriptors. The training data is then used to train the neural network. For example, the neural network may be trained with the training data using per-instance cosine similarity loss or correlational loss.Type: GrantFiled: June 26, 2020Date of Patent: December 20, 2022Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Daniel Kenneth Bone, Viktor Rozgic, Chao Wang
-
Patent number: 11517209Abstract: Pressure sensing guidewire assemblies are described herein where the guidewire assembly may be comprised of an elongate guidewire body and multiple pressure sensors secured near or at a distal end of the guidewire body. The signals obtained from the guidewire connectors and aortic sensor modules may be synchronized to minimize signal acquisition delays. The signals may be further processed to equalize the pressure waveforms by shifting the connector waveform to align correctly with the aortic module waveform and improve output signals.Type: GrantFiled: January 9, 2019Date of Patent: December 6, 2022Assignee: PATHWAYS MEDICAL CORPORATIONInventors: Goutam Dutta, Nitin Patil
-
Patent number: 11517254Abstract: A method and system for detecting errors when practicing fluency shaping exercises. The method includes setting each threshold of a set of thresholds to a respective predetermined initial value; analyzing a voice production to compute a set of first energy levels composing the voice production, wherein the voice production is of a user practicing a fluency shaping exercise; detecting at least one speech-related error based on the computed set of first energy levels, a set of second energy levels, and the set of thresholds, wherein the detection of the at least one speech-related error is with respect to the fluency shaping exercise being practiced by the user, wherein the set of second energy levels is determined based on a calibration process; and generating feedback indicating the detected at least one speech-related error.Type: GrantFiled: January 18, 2019Date of Patent: December 6, 2022Assignee: Novotalk, Ltd.Inventors: Moshe Rot, Lilach Rothschild, Smadar Lerner
-
Patent number: 11514926Abstract: A system configured to enable a Wi-Fi processor to enter a low power mode (LPM) for short periods of time without compromising functionality is provided. A device reduces power consumption by enabling the Wi-Fi processor to enter LPM with scheduled wakeup events to enable specific functionality. In some examples, the Wi-Fi processor toggles between LPM and an active mode based on a first duty cycle to enable new device provisioning. The first duty cycle corresponds to a time required to scan a plurality of wireless channels, waking the Wi-Fi processor at a first frequency to monitor for incoming probe requests. In other examples, the Wi-Fi processor uses a second duty cycle chosen to maintain time synchronicity between a time master device and time follower devices. The device sets the second duty cycle to wake the Wi-Fi processor at a second frequency to exchange data packets with synchronized devices.Type: GrantFiled: November 6, 2020Date of Patent: November 29, 2022Assignee: Amazon Technologies, Inc.Inventors: Dibyendu Nandy, Om Prakash Gangwal
-
Patent number: 11412171Abstract: Existence of instrumentation for automatic video recording creates an excess capacity of video recording for those who own automatic video recorders. Others may want to utilize this excess capacity to record their activities thus there is a need for a system that helps match those who would like to utilize the excess capacity with those who have such capacity. Such excess capacity is matched with demand to use such excess capacity by creating a network of automatic video recording units and tags that are associated with people who want to be recorded.Type: GrantFiled: February 16, 2021Date of Patent: August 9, 2022Assignee: H4 Engineering, Inc.Inventors: Christopher T. Boyle, Konstantin Othmer, Gordon Jason Glover, Alexander G. Sammons
-
Patent number: 11400601Abstract: The present invention allows a robot to carry out communication with excellent affectiveness. A speech and behavior control device (1) includes an utterance content selecting section (16) which selects utterance content of a robot (100) from among a plurality of utterances, a movement control section (17) which controls a movable part (13) to move based on a kind of feeling corresponding to the utterance content, and an audio control section (18) which controls the robot (100) to output the utterance content as audio after movement of the movable part (13) has been started.Type: GrantFiled: December 27, 2017Date of Patent: August 2, 2022Assignee: SHARP KABUSHIKI KAISHAInventor: Takuya Oyaizu
-
Patent number: 8994522Abstract: The described method and system provide for HMI steering for a telematics-equipped vehicle based on likelihood to exceed eye glance guidelines. By determining whether a task is likely to cause the user to exceed eye glance guidelines, alternative HMI processes may be presented to a user to reduce ASGT and EORT and increase compliance with eye glance guidelines. By allowing a user to navigate through long lists of items through vocal input, T9 text input, or heuristic processing rather than through conventional presentation of the full list, a user is much more likely to comply with the eye glance guidelines. This invention is particularly useful in contexts where users may be searching for one item out of a plurality of potential items, for example, within the context of hands-free calling contacts, playing back audio files, or finding points of interest during GPS navigation.Type: GrantFiled: May 26, 2011Date of Patent: March 31, 2015Assignees: General Motors LLC, GM Global Technology Operations LLCInventors: Steven C. Tengler, Bijaya Aryal, Scott P. Geisler, Michael A. Wuergler
-
Publication number: 20140253458Abstract: A method is provided for managing phrase completion suggestions in response to text input. The method includes receiving text entered into the computing system, and identifying a first plurality of phrases that each begins with the received text and that each includes a respective phrase segment immediately following the received text. The method further includes displaying a first list of the respective phrase segments of the identified first plurality of phrases without displaying the received text, and receiving input defining a selection of one of the respective phrase segments of the displayed first list.Type: ApplicationFiled: July 20, 2011Publication date: September 11, 2014Applicant: GOOGLE INC.Inventor: Nirmal J. Patel
-
Patent number: 8731715Abstract: A mobile device moves by calculating a distance between a sound source and the mobile device using a sound source direction estimation technique. The mobile device moves by a reference distance in a direction perpendicular to a direction in which the mobile device faces the sound source when call sound of the sound source is generated, outputs voice to instruct to the sound source to generate recall sound, checks a directional angle of the mobile device when recall sound is generated by the sound source, calculates the distance between the sound source and the mobile device according to the reference distance and the directional angle of the mobile device, and moves to the vicinity of the sound source.Type: GrantFiled: November 24, 2010Date of Patent: May 20, 2014Assignee: Samsung Electronics Co., Ltd.Inventors: Won Jun Ko, Yong Jae Kim, Woo Sup Han, Ki Cheol Park
-
Publication number: 20140122083Abstract: A chatbot system and method with contextual input/output messages. A chatbot includes a processor, an interactive dialog interface and a knowledge database. The system uses a script file to display input and output messages in a tree format. An initial input or output message is stored. An identifier is assigned to the initial input or output message that is then used as context for the subsequent input/output messages by associating and storing the identifier with the subsequent input/output messages. The relationship between the first input or output message and subsequent input/output messages define a parent-child relationship that is displayable via the script file.Type: ApplicationFiled: October 26, 2012Publication date: May 1, 2014Inventor: Duan Xiaojiang
-
Publication number: 20140108016Abstract: A graphical sketch can be received, the sketch including one or more representations of text. A query can be automatically generated from the sketch. The generation of the query can include automatically recognizing the text and automatically representing the text in the query. The query can be run to identify a picture in response to the query, with the text describing one or more non-textual features of the picture. The picture can be returned, such as in response to the receipt of the graphical sketch.Type: ApplicationFiled: October 15, 2012Publication date: April 17, 2014Applicant: MICROSOFT CORPORATIONInventor: Brian Albrecht
-
Publication number: 20140086395Abstract: In an embodiment, a system maintains a database of a plurality of persons. The database includes an audio clip of a pronunciation of a name of a first person in the database. The system determines from a calendar database that a second person has an event in common with the first person, and transmits to a device associated with the second person an indication that the database includes the pronunciation of the name of the first person.Type: ApplicationFiled: September 25, 2012Publication date: March 27, 2014Applicant: Linkedln CorporationInventors: Jonathan Redfern, Manish Mohan Sharma, Seth McLaughlin
-
Publication number: 20140074480Abstract: In-vehicle functions are implemented using a plurality of microphones disposed in a vehicle. Each of the microphones is disposed in a portion of the vehicle defined by a zone. The in-vehicle functions are also implemented via a central controller of the vehicle. The central controller includes a computer processor executing logic. The logic receive a voice communication from an individual via one of the microphones, identifies the zone in the vehicle occupied by the individual, identifies the individual by comparing a voice stamp from the voice communication to a database of voice stamps, and implements at least one vehicle electronic component in the zone based on user preferences associated with the voice stamp.Type: ApplicationFiled: September 11, 2012Publication date: March 13, 2014Applicant: GM GLOBAL TECHNOLOGY OPERATIONS LLCInventors: Jesse T. Gratke, Bassam S. Shahmurad
-
Publication number: 20140046668Abstract: A control method for a video-audio playing system receiving a video-audio streaming signal is provided. The video-audio streaming signal includes at least a channel-program information. The control method comprises receiving a speech signal and analyzing the speech signal to obtain an acoustic feature of the speech signal. According to the acoustic feature, a speech recognition is performed to determine one of the channel-program information corresponds to the acoustic feature. According to the determined channel-program information, the video-audio playing system executes an operation corresponding to the channel-program information.Type: ApplicationFiled: September 10, 2012Publication date: February 13, 2014Applicant: WISTRON CORPORATIONInventor: Chih-Wen Huang
-
Publication number: 20140032218Abstract: Dynamic adjustment of text input system components is provided. An indication of user activity with respect to a text input system of an electronic device is received. One or more activity indicators are determined based on at least the user activity. One or more components of the text input system are identified, each component providing a typing assistance functionality to a user and being associated with a set of parameters. For each of the one or more components, a determination is made whether the component should be adjusted based on the one or more activity indicators, and the component is dynamically adjusted when it is determined that the component should be adjusted based on the one or more activity indicators. Dynamically adjusting the component includes at least one of activating the component, deactivating the component or adjusting the set of parameters associated with the component.Type: ApplicationFiled: July 30, 2012Publication date: January 30, 2014Applicant: GOOGLE INC.Inventor: Bryan Russell Yeung
-
Publication number: 20140032222Abstract: A medical error alert device may comprise a controller; a first memory, a recording and playback module and a user interface. The user interface may be configured to enable a patient or a patient representative to record an announcement identifying at least a medical procedure to be carried out. The user interface may be further configured to enable later playback of the announcement before the medical procedure is carried out. A communication device may be provided, coupled to a network to enable reception of signals from the network comprising at least predetermined patient identification number and/or a unique medical alert device identifier. A predetermined alert may be generated responsive to the communication device receiving a signal associated with the predetermined alert and the patient identification number and/or the unique device identifier.Type: ApplicationFiled: August 29, 2012Publication date: January 30, 2014Applicant: TransMed 7, LLCInventors: Sally J. VETTER, Heather L. YOUNG, James W. VETTER
-
Publication number: 20140032220Abstract: A dialog system is accessed by a remote user and is typically configured to receive a natural language query from the user and return a natural language answer to the user. Dialog systems can be copied without authorization or can become an out-of-date version. A dialog system with a signature, referred to herein as a “signed” dialog system, can indicate the signature without affecting usage by users who are unaware that the dialog system contains the signature. The signed dialog system can respond to input such that only the designer of the dialog system knows the signature is embedded in the dialog system. The response is a way to check the source or other characteristics of the dialog system. A designer of signed dialog systems can prove whether an unauthorized copy of the signed dialog system is used by a third party by using publically-available user interfaces.Type: ApplicationFiled: July 27, 2012Publication date: January 30, 2014Inventor: Solomon Z. Lerner
-
Publication number: 20140032223Abstract: The embodiments disclosed herein relate to a system and method for processing a prescription through voice-activated commands. The system and method efficiently and effectively process the prescription so that a pharmacy may handle the increasing prescription processing demands.Type: ApplicationFiled: July 27, 2012Publication date: January 30, 2014Inventor: Roderick Powe
-
Publication number: 20140032221Abstract: A medical error alert device may comprise a controller; a first memory, a recording and playback module and a user interface. The user interface may be configured to enable a patient or a patient representative to record an announcement identifying at least a medical procedure to be carried out. The user interface may be further configured to enable later playback of the announcement before the medical procedure is carried out. A communication device may be provided, coupled to a network to enable reception of signals from the network comprising at least predetermined patient identification number and/or a unique medical alert device identifier. A predetermined alert may be generated responsive to the communication device receiving a signal associated with the predetermined alert and the patient identification number and/or the unique device identifier.Type: ApplicationFiled: July 28, 2012Publication date: January 30, 2014Applicant: TransMed 7, LLCInventors: Sally J. VETTER, Heather L. Young, James W. Vetter
-
Publication number: 20140032219Abstract: In one embodiment, a method comprises classifying a representation of audio data of a dialog turn in a dialog system to a classification. The method may further comprise taking a security action on the classified representation of the audio data of the dialog turn as a function of the classification. The security action can be suppressing the representation of the audio data, encrypting the representation of the audio data, releasing the representation of the audio data, partially suppressing the representation of the audio data, partially encrypting the representation of the audio data, partially releasing the representation of the audio data, or a command.Type: ApplicationFiled: July 27, 2012Publication date: January 30, 2014Inventors: Solomon Z. Lerner, Mark Fanty