Miscellaneous Analysis Or Detection Of Speech Characteristics (epo) Patents (Class 704/E11.001)
  • Patent number: 11907666
    Abstract: Various embodiments of a system and associated method for anonymization of text without losing semantic utility of text by extracting a latent embedding representation of content with respect to a given task and by learning an optimal strategy for text embedding manipulation to satisfy both privacy and utility requirements are disclosed herein. In particular, the system balances private attribute obfuscation with retained semantic utility.
    Type: Grant
    Filed: November 16, 2021
    Date of Patent: February 20, 2024
    Assignee: Arizona Board of Regents on Behalf of Arizona State University
    Inventors: Ahmadreza Mosallanezhad, Ghazaleh Beigi, Huan Liu
  • Patent number: 11887590
    Abstract: Methods and devices for enabling and disabling applications using voice are described herein. In some embodiments, an individual speak an utterance to their electronic device, which may send audio data representing the utterance to a backend system. The backend system may generate text data representing the utterance, and may determine that an intent of the utterance was for an application to be enabled or disabled for their user account on the backend system. If, for instance, the intent was to enable the application, the backend system may receive one or more rules for performing functionalities of the application, as well as one or more sample templates of sample utterances and sample responses that future utterances may use when requesting the application. Furthermore, one or more invocation phrases that may be used within the future utterances to invoke the application may be received, along with slot values for the sample templates.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: January 30, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Shaman D'Souza, Ian Suttle, Srikanth Nori, Rajiv Reddy, Amol Kanitkar, Tina Orooji
  • Patent number: 11889142
    Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.
    Type: Grant
    Filed: December 9, 2022
    Date of Patent: January 30, 2024
    Assignee: Universal Electronics Inc.
    Inventors: Thomas Hascher, Menno Koopmans
  • Patent number: 11830380
    Abstract: Methods, systems and computer program products for automated learning are provided herein. A computer-implemented method includes authenticating a plurality of users for an automated learning session, wherein the plurality of users correspond to at least one device, and providing the automated learning session for the plurality of users. Providing the automated learning session comprises analyzing a plurality of learning models corresponding to one or more of the plurality of users, determining, based on the analysis, one or more activities to be performed by the plurality of users during the automated learning session, and executing the one or more activities on at least one device.
    Type: Grant
    Filed: January 10, 2019
    Date of Patent: November 28, 2023
    Assignee: International Business Machines Corporation
    Inventors: Smitkumar Narotambhai Marvaniya, Tejas Indulal Dhamecha, Malolan Chetlur, Renuka Sindhgatta, Bikram Sengupta
  • Patent number: 11798579
    Abstract: A parameter included in a fundamental frequency pattern of a voice can be estimated from the fundamental frequency pattern with high accuracy and the fundamental frequency pattern of the voice can be reconstructed from the parameter included in the fundamental frequency pattern. A learning unit 30 learns a deep generation model including an encoder which regards a parameter included in a fundamental frequency pattern in a voice signal as a latent variable of the deep generation model and estimates the latent variable from the fundamental frequency pattern in the voice signal on the basis of parallel data of the fundamental frequency pattern in the voice signal and the parameter included in the fundamental frequency pattern in the voice signal, and a decoder which reconstructs the fundamental frequency pattern in the voice signal from the latent variable.
    Type: Grant
    Filed: February 19, 2019
    Date of Patent: October 24, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Ko Tanaka, Hirokazu Kameoka
  • Patent number: 11755847
    Abstract: Embodiments described herein provide adversarial attacks targeting the cross-lingual generalization ability of massive multilingual representations, demonstrating their effectiveness on multilingual models for natural language inference and question answering. An efficient adversarial training scheme can thus be implemented with the adversarial attacks, which takes the same number of steps as standard supervised training and show that it encourages language-invariance in representations, thereby improving both clean and robust accuracy.
    Type: Grant
    Filed: January 15, 2021
    Date of Patent: September 12, 2023
    Assignee: Salesforce, Inc.
    Inventors: Samson Min Rong Tan, Shafiq Rayhan Joty
  • Patent number: 11721328
    Abstract: The present invention discloses a method and apparatus for awakening skills by speech, which are applied to an electronic device. The method for awakening skills by speech includes: recognizing awakening text information corresponding to a speech request message to be processed; invoking a service skill semantic model to determine a target service field corresponding to the awakening text information and a corresponding first confidence, and invoking a knowledge skill semantic model to determine a knowledge reply answer corresponding to the awakening text information and a corresponding second confidence; and selecting to awaken one of a knowledge skill and a target service skill corresponding to the target service field based on the first confidence and the second confidence. Accordingly, the probability of erroneously awakening a skill based on the speech message can be reduced.
    Type: Grant
    Filed: October 26, 2020
    Date of Patent: August 8, 2023
    Assignee: AI SPEECH CO., LTD.
    Inventor: Chengya Zhu
  • Patent number: 11663415
    Abstract: The following relates generally to voice assisted healthcare. In some embodiments, a digital assistant receives audio data, and determines an intent from the audio data. The digital assistant may then match the determined intent to a flow of a set of flows, where the set of flows may include at least one of: (i) submitting a prescription, (ii) refilling a prescription, (iii) changing a pickup location, (iv) requesting a status update for a prescription, or (v) initiating a pharmacy chat session. The matched flow of the set of flows may then be executed.
    Type: Grant
    Filed: August 31, 2020
    Date of Patent: May 30, 2023
    Assignee: WALGREEN CO.
    Inventors: Julija Alegra Petkus, Andrew David Schweinfurth, Stephen Elijah Zambo
  • Patent number: 11638086
    Abstract: A method and an apparatus for enabling adaptive audio signal alteration are described. When an input audio signal is received, a determination of whether the user of an audio device hears the input audio signal is performed based upon brain activity of the user. A determination of whether the user is distracted by the audio signal is performed based upon sensor measurements indicating a physical state of the user. In response to determining that the user hears the input audio signal and that the input audio signal causes the user to be distracted, a determination of configuration parameter(s) is performed. An alteration of audio signal(s) is caused based upon the configuration parameter(s) to obtain modified version(s) of the audio signal(s) that are intended to address the distraction caused by the input audio signal, and output audio signals are output, where the output audio signals include the modified versions.
    Type: Grant
    Filed: June 29, 2022
    Date of Patent: April 25, 2023
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Matthew John Lawrenson, Jan Jasper Van Den Berg, Jacob Ström, Lars Andersson
  • Patent number: 11627189
    Abstract: Techniques for implementing a “sticky” user ID are described. A system receives first input audio data and determines first speech processing results therefrom. The system also determines a first user ID of a user that spoke an utterance represented in the first input audio data and associates the first user ID with a device, which originated the first input audio data, for a predetermined length of time. The system determines first output data responsive to the first speech processing data and causes the device to present first output content corresponding thereto. The system then receives second input audio data and determines second speech processing results therefrom. The system also determines a time of receipt of the second input audio data is within the predetermined length of time. Based at least in part thereon, the system determined second output data responsive to the second speech processing data using the first user ID.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: April 11, 2023
    Assignee: Amazon Technologies, Inc.
    Inventor: Yu Bao
  • Patent number: 11583239
    Abstract: A new chest X-ray database, referred to as “ChestX-ray8”, is disclosed herein, which comprises over 100,000 frontal view X-ray images of over 32,000 unique patients with the text-mined eight disease image labels (where each image can have multi-labels), from the associated radiological reports using natural language processing. We demonstrate that these commonly occurring thoracic diseases can be detected and spatially-located via a unified weakly supervised multi-label image classification and disease localization framework, which is validated using our disclosed dataset.
    Type: Grant
    Filed: March 26, 2018
    Date of Patent: February 21, 2023
    Assignee: The United States of America, as represented by the Secretary, Department of Health and Human Service
    Inventors: Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Ronald M. Summers
  • Patent number: 11574008
    Abstract: Methods and apparatus for audio identification during a performance are disclosed herein. An example apparatus includes at least one memory and at least one processor to transform a segment of audio into a log-frequency spectrogram based on a constant Q transform using a logarithmic frequency resolution, transform the log-frequency spectrogram into a binary image, each pixel of the binary image corresponding to a time frame and frequency channel pair, each frequency channel representing a corresponding quarter tone frequency channel in a range from C3-C8, generate a matrix product of the binary image and a plurality of reference fingerprints, normalize the matrix product to form a similarity matrix, select an alignment of a line in the similarity matrix that intersects one or more bins in the similarity matrix with the largest calculated Hamming similarities, and select a reference fingerprint based on the alignment.
    Type: Grant
    Filed: November 23, 2020
    Date of Patent: February 7, 2023
    Assignee: Gracenote, Inc.
    Inventors: Dale T. Roberts, Bob Coover, Nicola Marcantonio, Markus K. Cremer
  • Patent number: 11570504
    Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.
    Type: Grant
    Filed: November 6, 2020
    Date of Patent: January 31, 2023
    Assignee: Universal Electronics Inc.
    Inventors: Thomas Hascher, Menno Koopmans
  • Patent number: 11556308
    Abstract: An information processing system includes: an image display apparatus provided in a space and configured to display an image; a sensor apparatus carried by a user who is present in the space and configured to output a signal for detecting position information of the user in the space; and an information processing apparatus. The information processing apparatus includes circuitry configured to store a plurality of pieces of position information of a plurality of users including the user, who are in present in the space, in association with the plurality of users, the plurality of users being detected based on signals output from a plurality of sensor apparatuses including the sensor apparatus, and control environment effect production that supports communication between the plurality of users by the image displayed by the image display apparatus, based on each of the plurality of pieces of position information of the plurality of users.
    Type: Grant
    Filed: February 12, 2021
    Date of Patent: January 17, 2023
    Assignee: RICOH COMPANY, LTD.
    Inventor: Haruki Murata
  • Patent number: 11532300
    Abstract: A device with a microphone acquires audio data of a user's speech. A neural network accepts audio data as input and provides sentiment data as output. The neural network is trained using training data based on input from raters who provide votes as to which sentiment descriptors they think are associated with a sample of speech. A vote by a rater assessing the sample for a particular semantic descriptor is distributed to a plurality of semantically similar semantic descriptors. Semantic descriptor similarity data indicates relative similarity between possible semantic descriptors in the semantic space. The distributed partial votes may be aggregated to produce training data comprising samples of speech and weights of corresponding semantic descriptors. The training data is then used to train the neural network. For example, the neural network may be trained with the training data using per-instance cosine similarity loss or correlational loss.
    Type: Grant
    Filed: June 26, 2020
    Date of Patent: December 20, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Daniel Kenneth Bone, Viktor Rozgic, Chao Wang
  • Patent number: 11517254
    Abstract: A method and system for detecting errors when practicing fluency shaping exercises. The method includes setting each threshold of a set of thresholds to a respective predetermined initial value; analyzing a voice production to compute a set of first energy levels composing the voice production, wherein the voice production is of a user practicing a fluency shaping exercise; detecting at least one speech-related error based on the computed set of first energy levels, a set of second energy levels, and the set of thresholds, wherein the detection of the at least one speech-related error is with respect to the fluency shaping exercise being practiced by the user, wherein the set of second energy levels is determined based on a calibration process; and generating feedback indicating the detected at least one speech-related error.
    Type: Grant
    Filed: January 18, 2019
    Date of Patent: December 6, 2022
    Assignee: Novotalk, Ltd.
    Inventors: Moshe Rot, Lilach Rothschild, Smadar Lerner
  • Patent number: 11517209
    Abstract: Pressure sensing guidewire assemblies are described herein where the guidewire assembly may be comprised of an elongate guidewire body and multiple pressure sensors secured near or at a distal end of the guidewire body. The signals obtained from the guidewire connectors and aortic sensor modules may be synchronized to minimize signal acquisition delays. The signals may be further processed to equalize the pressure waveforms by shifting the connector waveform to align correctly with the aortic module waveform and improve output signals.
    Type: Grant
    Filed: January 9, 2019
    Date of Patent: December 6, 2022
    Assignee: PATHWAYS MEDICAL CORPORATION
    Inventors: Goutam Dutta, Nitin Patil
  • Patent number: 11514926
    Abstract: A system configured to enable a Wi-Fi processor to enter a low power mode (LPM) for short periods of time without compromising functionality is provided. A device reduces power consumption by enabling the Wi-Fi processor to enter LPM with scheduled wakeup events to enable specific functionality. In some examples, the Wi-Fi processor toggles between LPM and an active mode based on a first duty cycle to enable new device provisioning. The first duty cycle corresponds to a time required to scan a plurality of wireless channels, waking the Wi-Fi processor at a first frequency to monitor for incoming probe requests. In other examples, the Wi-Fi processor uses a second duty cycle chosen to maintain time synchronicity between a time master device and time follower devices. The device sets the second duty cycle to wake the Wi-Fi processor at a second frequency to exchange data packets with synchronized devices.
    Type: Grant
    Filed: November 6, 2020
    Date of Patent: November 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Dibyendu Nandy, Om Prakash Gangwal
  • Patent number: 11412171
    Abstract: Existence of instrumentation for automatic video recording creates an excess capacity of video recording for those who own automatic video recorders. Others may want to utilize this excess capacity to record their activities thus there is a need for a system that helps match those who would like to utilize the excess capacity with those who have such capacity. Such excess capacity is matched with demand to use such excess capacity by creating a network of automatic video recording units and tags that are associated with people who want to be recorded.
    Type: Grant
    Filed: February 16, 2021
    Date of Patent: August 9, 2022
    Assignee: H4 Engineering, Inc.
    Inventors: Christopher T. Boyle, Konstantin Othmer, Gordon Jason Glover, Alexander G. Sammons
  • Patent number: 11400601
    Abstract: The present invention allows a robot to carry out communication with excellent affectiveness. A speech and behavior control device (1) includes an utterance content selecting section (16) which selects utterance content of a robot (100) from among a plurality of utterances, a movement control section (17) which controls a movable part (13) to move based on a kind of feeling corresponding to the utterance content, and an audio control section (18) which controls the robot (100) to output the utterance content as audio after movement of the movable part (13) has been started.
    Type: Grant
    Filed: December 27, 2017
    Date of Patent: August 2, 2022
    Assignee: SHARP KABUSHIKI KAISHA
    Inventor: Takuya Oyaizu
  • Patent number: 8994522
    Abstract: The described method and system provide for HMI steering for a telematics-equipped vehicle based on likelihood to exceed eye glance guidelines. By determining whether a task is likely to cause the user to exceed eye glance guidelines, alternative HMI processes may be presented to a user to reduce ASGT and EORT and increase compliance with eye glance guidelines. By allowing a user to navigate through long lists of items through vocal input, T9 text input, or heuristic processing rather than through conventional presentation of the full list, a user is much more likely to comply with the eye glance guidelines. This invention is particularly useful in contexts where users may be searching for one item out of a plurality of potential items, for example, within the context of hands-free calling contacts, playing back audio files, or finding points of interest during GPS navigation.
    Type: Grant
    Filed: May 26, 2011
    Date of Patent: March 31, 2015
    Assignees: General Motors LLC, GM Global Technology Operations LLC
    Inventors: Steven C. Tengler, Bijaya Aryal, Scott P. Geisler, Michael A. Wuergler
  • Publication number: 20140253458
    Abstract: A method is provided for managing phrase completion suggestions in response to text input. The method includes receiving text entered into the computing system, and identifying a first plurality of phrases that each begins with the received text and that each includes a respective phrase segment immediately following the received text. The method further includes displaying a first list of the respective phrase segments of the identified first plurality of phrases without displaying the received text, and receiving input defining a selection of one of the respective phrase segments of the displayed first list.
    Type: Application
    Filed: July 20, 2011
    Publication date: September 11, 2014
    Applicant: GOOGLE INC.
    Inventor: Nirmal J. Patel
  • Patent number: 8731715
    Abstract: A mobile device moves by calculating a distance between a sound source and the mobile device using a sound source direction estimation technique. The mobile device moves by a reference distance in a direction perpendicular to a direction in which the mobile device faces the sound source when call sound of the sound source is generated, outputs voice to instruct to the sound source to generate recall sound, checks a directional angle of the mobile device when recall sound is generated by the sound source, calculates the distance between the sound source and the mobile device according to the reference distance and the directional angle of the mobile device, and moves to the vicinity of the sound source.
    Type: Grant
    Filed: November 24, 2010
    Date of Patent: May 20, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Won Jun Ko, Yong Jae Kim, Woo Sup Han, Ki Cheol Park
  • Publication number: 20140122083
    Abstract: A chatbot system and method with contextual input/output messages. A chatbot includes a processor, an interactive dialog interface and a knowledge database. The system uses a script file to display input and output messages in a tree format. An initial input or output message is stored. An identifier is assigned to the initial input or output message that is then used as context for the subsequent input/output messages by associating and storing the identifier with the subsequent input/output messages. The relationship between the first input or output message and subsequent input/output messages define a parent-child relationship that is displayable via the script file.
    Type: Application
    Filed: October 26, 2012
    Publication date: May 1, 2014
    Inventor: Duan Xiaojiang
  • Publication number: 20140108016
    Abstract: A graphical sketch can be received, the sketch including one or more representations of text. A query can be automatically generated from the sketch. The generation of the query can include automatically recognizing the text and automatically representing the text in the query. The query can be run to identify a picture in response to the query, with the text describing one or more non-textual features of the picture. The picture can be returned, such as in response to the receipt of the graphical sketch.
    Type: Application
    Filed: October 15, 2012
    Publication date: April 17, 2014
    Applicant: MICROSOFT CORPORATION
    Inventor: Brian Albrecht
  • Publication number: 20140086395
    Abstract: In an embodiment, a system maintains a database of a plurality of persons. The database includes an audio clip of a pronunciation of a name of a first person in the database. The system determines from a calendar database that a second person has an event in common with the first person, and transmits to a device associated with the second person an indication that the database includes the pronunciation of the name of the first person.
    Type: Application
    Filed: September 25, 2012
    Publication date: March 27, 2014
    Applicant: Linkedln Corporation
    Inventors: Jonathan Redfern, Manish Mohan Sharma, Seth McLaughlin
  • Publication number: 20140074480
    Abstract: In-vehicle functions are implemented using a plurality of microphones disposed in a vehicle. Each of the microphones is disposed in a portion of the vehicle defined by a zone. The in-vehicle functions are also implemented via a central controller of the vehicle. The central controller includes a computer processor executing logic. The logic receive a voice communication from an individual via one of the microphones, identifies the zone in the vehicle occupied by the individual, identifies the individual by comparing a voice stamp from the voice communication to a database of voice stamps, and implements at least one vehicle electronic component in the zone based on user preferences associated with the voice stamp.
    Type: Application
    Filed: September 11, 2012
    Publication date: March 13, 2014
    Applicant: GM GLOBAL TECHNOLOGY OPERATIONS LLC
    Inventors: Jesse T. Gratke, Bassam S. Shahmurad
  • Publication number: 20140046668
    Abstract: A control method for a video-audio playing system receiving a video-audio streaming signal is provided. The video-audio streaming signal includes at least a channel-program information. The control method comprises receiving a speech signal and analyzing the speech signal to obtain an acoustic feature of the speech signal. According to the acoustic feature, a speech recognition is performed to determine one of the channel-program information corresponds to the acoustic feature. According to the determined channel-program information, the video-audio playing system executes an operation corresponding to the channel-program information.
    Type: Application
    Filed: September 10, 2012
    Publication date: February 13, 2014
    Applicant: WISTRON CORPORATION
    Inventor: Chih-Wen Huang
  • Publication number: 20140032221
    Abstract: A medical error alert device may comprise a controller; a first memory, a recording and playback module and a user interface. The user interface may be configured to enable a patient or a patient representative to record an announcement identifying at least a medical procedure to be carried out. The user interface may be further configured to enable later playback of the announcement before the medical procedure is carried out. A communication device may be provided, coupled to a network to enable reception of signals from the network comprising at least predetermined patient identification number and/or a unique medical alert device identifier. A predetermined alert may be generated responsive to the communication device receiving a signal associated with the predetermined alert and the patient identification number and/or the unique device identifier.
    Type: Application
    Filed: July 28, 2012
    Publication date: January 30, 2014
    Applicant: TransMed 7, LLC
    Inventors: Sally J. VETTER, Heather L. Young, James W. Vetter
  • Publication number: 20140032223
    Abstract: The embodiments disclosed herein relate to a system and method for processing a prescription through voice-activated commands. The system and method efficiently and effectively process the prescription so that a pharmacy may handle the increasing prescription processing demands.
    Type: Application
    Filed: July 27, 2012
    Publication date: January 30, 2014
    Inventor: Roderick Powe
  • Publication number: 20140032218
    Abstract: Dynamic adjustment of text input system components is provided. An indication of user activity with respect to a text input system of an electronic device is received. One or more activity indicators are determined based on at least the user activity. One or more components of the text input system are identified, each component providing a typing assistance functionality to a user and being associated with a set of parameters. For each of the one or more components, a determination is made whether the component should be adjusted based on the one or more activity indicators, and the component is dynamically adjusted when it is determined that the component should be adjusted based on the one or more activity indicators. Dynamically adjusting the component includes at least one of activating the component, deactivating the component or adjusting the set of parameters associated with the component.
    Type: Application
    Filed: July 30, 2012
    Publication date: January 30, 2014
    Applicant: GOOGLE INC.
    Inventor: Bryan Russell Yeung
  • Publication number: 20140032222
    Abstract: A medical error alert device may comprise a controller; a first memory, a recording and playback module and a user interface. The user interface may be configured to enable a patient or a patient representative to record an announcement identifying at least a medical procedure to be carried out. The user interface may be further configured to enable later playback of the announcement before the medical procedure is carried out. A communication device may be provided, coupled to a network to enable reception of signals from the network comprising at least predetermined patient identification number and/or a unique medical alert device identifier. A predetermined alert may be generated responsive to the communication device receiving a signal associated with the predetermined alert and the patient identification number and/or the unique device identifier.
    Type: Application
    Filed: August 29, 2012
    Publication date: January 30, 2014
    Applicant: TransMed 7, LLC
    Inventors: Sally J. VETTER, Heather L. YOUNG, James W. VETTER
  • Publication number: 20140032219
    Abstract: In one embodiment, a method comprises classifying a representation of audio data of a dialog turn in a dialog system to a classification. The method may further comprise taking a security action on the classified representation of the audio data of the dialog turn as a function of the classification. The security action can be suppressing the representation of the audio data, encrypting the representation of the audio data, releasing the representation of the audio data, partially suppressing the representation of the audio data, partially encrypting the representation of the audio data, partially releasing the representation of the audio data, or a command.
    Type: Application
    Filed: July 27, 2012
    Publication date: January 30, 2014
    Inventors: Solomon Z. Lerner, Mark Fanty
  • Publication number: 20140032220
    Abstract: A dialog system is accessed by a remote user and is typically configured to receive a natural language query from the user and return a natural language answer to the user. Dialog systems can be copied without authorization or can become an out-of-date version. A dialog system with a signature, referred to herein as a “signed” dialog system, can indicate the signature without affecting usage by users who are unaware that the dialog system contains the signature. The signed dialog system can respond to input such that only the designer of the dialog system knows the signature is embedded in the dialog system. The response is a way to check the source or other characteristics of the dialog system. A designer of signed dialog systems can prove whether an unauthorized copy of the signed dialog system is used by a third party by using publically-available user interfaces.
    Type: Application
    Filed: July 27, 2012
    Publication date: January 30, 2014
    Inventor: Solomon Z. Lerner
  • Publication number: 20130339028
    Abstract: A voice activation system is provided. The voice activation system includes a first stage configured to output a first activation signal if at least one energy characteristic of a received audio signal satisfies at least one threshold and a second stage configured to transition from a first state to a second state in response to the first activation signal and, when in the second state, to output a second activation signal if at least a portion of a profile of the audio signal substantially matches at least one predetermined profile.
    Type: Application
    Filed: June 15, 2012
    Publication date: December 19, 2013
    Applicant: Spansion LLC
    Inventors: Stephan ROSNER, Chen Liu, Jens Olson
  • Publication number: 20130325480
    Abstract: A remote controller includes a housing, a direction sensor, a microphone, a controller, and a wireless transmitter. A control method of the remote controller includes detecting an angle between an axis of a remote controller and a vertical axis, enabling a microphone of the remote controller when the angle is within a predetermined range in order to generate a voice signal according to a voice command, and generating a first control signal according the voice signal and transmit the first control signal wirelessly.
    Type: Application
    Filed: September 13, 2012
    Publication date: December 5, 2013
    Applicant: AU OPTRONICS CORP.
    Inventors: Yin-Ting Lee, Chien-Hung Chen, Chang-Ho Shen
  • Publication number: 20130325446
    Abstract: The instant application includes computationally-implemented systems and methods that include acquiring indication of a speech-facilitated transaction between a particular party and a target device, receiving adaptation data correlated to the particular party, the receiving facilitated by a particular device associated with the particular party, processing audio data from the particular party at least partly using the received adaptation data correlated to the particular party, and updating the adaptation data based at least in part on a result of the processed audio data, such that the updated adaptation data is configured to be transmitted to the particular device. In addition to the foregoing, other aspects are described in the claims, drawings, and text.
    Type: Application
    Filed: June 29, 2012
    Publication date: December 5, 2013
    Inventors: Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud
  • Publication number: 20130325447
    Abstract: The instant application includes computationally-implemented systems and methods that include acquiring indication of a speech-facilitated transaction between a particular party and a target device, receiving adaptation data correlated to the particular party, the receiving facilitated by a particular device associated with the particular party, processing audio data from the particular party at least partly using the received adaptation data correlated to the particular party, and updating the adaptation data based at least in part on a result of the processed audio data, such that the updated adaptation data is configured to be transmitted to the particular device. In addition to the foregoing, other aspects are described in the claims, drawings, and text.
    Type: Application
    Filed: June 29, 2012
    Publication date: December 5, 2013
    Inventors: Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud
  • Patent number: 8599836
    Abstract: Disclosed is an on demand, web-based, outbound contact center utilizing Voice over IP (VoIP) and speaker-independent voice recognition which automatically captures contact responses to question events in a pre-recorded, interactive voice call, the call launched by a user via a broadcast comprising a call sequence created by the user via a call center user interface comprising event add and logic add wizards, the call sequence comprising event prompts based on a user-generated script comprising message events and question events, the event prompts in the group consisting of voice recordings and text-to-speech inputs.
    Type: Grant
    Filed: January 21, 2011
    Date of Patent: December 3, 2013
    Assignee: Neobitspeak LLC
    Inventors: Terry Lynn Van Buren, Vesna Rafaty
  • Publication number: 20130317827
    Abstract: A computer-implemented system includes one or multiple application devices and a voice-controlled storage device. Multiple voice commands may be issued to multiple application devices simultaneously or separately, or to the same application device separately. The voice-controlled storage device is configured to perform content identification and voiceprint recognition on the voice commands. Therefore, each requestor may be allowed to operate the voice-controlled storage device in a corresponding operation mode according to respective authorization level.
    Type: Application
    Filed: May 23, 2012
    Publication date: November 28, 2013
    Inventors: Tsung-Chun Fu, I-Ming Lo
  • Publication number: 20130304475
    Abstract: A method of configuring an acoustics system of a convertible vehicle to receive speech from an occupant of the vehicle who is using hand-free technology. The position of the top of the convertible is first determined and based upon whether the top is up or down, an audio reception configuration is selected. The audio reception configuration includes a set of tuning parameters and a microphone arrangement. The acoustics system is then configured based upon the determination of whether the top is up or down.
    Type: Application
    Filed: May 14, 2012
    Publication date: November 14, 2013
    Applicant: GENERAL MOTORS LLC
    Inventors: Jesse T. Gratke, Craig A. Lambert, Kurt J. Reichert
  • Publication number: 20130289994
    Abstract: Techniques disclosed herein include systems and methods that enable a voice trigger that wakes-up an electronic device or causes the device to make additional voice commands active, without manual initiation of voice command functionality. In addition, such a voice trigger is dynamically programmable or customizable. A speaker can program or designate a particular phrase as the voice trigger. In general, techniques herein execute a voice-activated wake-up system that operates on a digital signal processor (DSP) or other low-power, secondary processing unit of an electronic device instead of running on a central processing unit (CPU). A speech recognition manager runs two speech recognition systems on an electronic device. The CPU dynamically creates a compact speech system for the DSP. Such a compact system can be continuously run during a standby mode, without quickly exhausting a battery supply.
    Type: Application
    Filed: April 26, 2012
    Publication date: October 31, 2013
    Inventors: Michael Jack Newman, Robert Roth, William D. Alexander, Paul van Mulbregt
  • Publication number: 20130275139
    Abstract: A system and methodology of delivery fluids and monitoring their status which is voice actuated. This system has application where a hands-free environment is preferred. Voice commands are given by the user via a Bluetooth® headset and received typically by the user's Smartphone. Voice recognition circuitry is programmed to recognize the simple commands and through complementing electronics, and electro-mechanical and mechanical elements, delivery at corresponding flow rates is accomplished. A further feature allows for respective voice commands to initiate a monitoring function where the status of any particular characteristic of the fluid can be relayed back to the user via the headset.
    Type: Application
    Filed: May 21, 2012
    Publication date: October 17, 2013
    Inventor: DENNIS R. COLEMAN
  • Publication number: 20130262104
    Abstract: A procurement system may include a first interface configured to receive a query from a user, a command module configured to parameterize the query, an intelligent search and match engine configured to compare the parameterized query with stored queries in a historical knowledge base and, in the event the parameterized query does not match a stored query within the historical knowledge base, search for a match in a plurality of knowledge models, and a response solution engine configured to receive a system response ID from the intelligent search and match engine, the response solution engine being configured to initiate a system action by interacting with sub-system and related databases to generate a system response.
    Type: Application
    Filed: March 28, 2012
    Publication date: October 3, 2013
    Inventors: Subhash Makhija, Santosh Katakol, Dhananlay Nagalkar, Siddhaarth Iyer, Ravi Mevcha
  • Publication number: 20130218566
    Abstract: The text-to-speech audio HIP technique described herein in some embodiments uses different correlated or uncorrelated words or sentences generated via a text-to-speech engine as audio HIP challenges. The technique can apply different effects in the text-to-speech synthesizer speaking a sentence to be used as a HIP challenge string. The different effects can include, for example, spectral frequency warping; vowel duration warping; background addition; echo addition; and varying the time duration between words, among others. In some embodiments the technique varies the set of parameters to prevent using Automated Speech Recognition tools from using previously used audio HIP challenges to learn a model which can then be used to recognize future audio HIP challenges generated by the technique. Additionally, in some embodiments the technique introduces the requirement of semantic understanding in HIP challenges.
    Type: Application
    Filed: February 17, 2012
    Publication date: August 22, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Yao Qian, Frank Kao-Ping Soong, Bin Benjamin Zhu
  • Publication number: 20130204604
    Abstract: A language interpretation system receives a request for an interpretation of a voice communication between a first language and a second language. Further, the language interpretation system provides the request to a machine language interpreter. In addition, the machine language interpreter provides live language interpretation of the voice communication. The live language interpretation of the voice communication is halted by the machine language interpreter in real time during the live language interpretation based upon a criteria being met. Further, the voice communication is transitioned to a human language interpreter to resume the live language interpretation of the voice communication after the machine language interpreter is halted.
    Type: Application
    Filed: February 6, 2012
    Publication date: August 8, 2013
    Inventor: Lindsay D'Penha
  • Publication number: 20130204626
    Abstract: Methods and systems for setting selected automatic speech recognition parameters are described. A data set associated with operation of a speech recognition application is defined and includes: i. recognition states characterizing the semantic progression of a user interaction with the speech recognition application, and ii. recognition outcomes associated with each recognition state. For a selected user interaction with the speech recognition application, an application cost function is defined that characterizes an estimated cost of the user interaction for each recognition outcome. For one or more system performance parameters indirectly related to the user interaction, the parameters are set to values which optimize the cost of the user interaction over the recognition states.
    Type: Application
    Filed: February 3, 2012
    Publication date: August 8, 2013
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventor: Jeffrey N. Marcus
  • Publication number: 20130197914
    Abstract: A voice activated system for operating electronic devices in an environment includes a microphone for receiving a verbal command that requests the addition of a new voice command, a first processor, that is electrically connected to the microphone, for receiving a customized command input regarding a preexisting user for the voice activated system that should be associated with the new verbal command, input involving a new verbal command, and input involving a system command, where the first processor is then able to receive verbal input to recognize a user, a verbal command, and then determine an associated action, an appropriate command for that action and then generate an associated system command, and a second processor, in electronic communication with the first processor, and two or more electronic devices in an environment, where the second processor is capable of receiving the system command and operating the two or more devices.
    Type: Application
    Filed: January 26, 2012
    Publication date: August 1, 2013
    Applicant: MicroTechnologies LLC d/b/a MicroTech
    Inventors: Timothy Yelvington, Edward J. Kennedy, Johnny BA Tran, Brandon K. Griffin
  • Publication number: 20130191110
    Abstract: A computer program product is provided and includes a non-transitory tangible storage medium readable by a processing circuit and on which instructions are stored for execution by the processing circuit for performing a method. The method includes enabling retrieval of a keyboard pressed sequence of characters of a first type, permitting a re-selection of characters of a second type, which are associated with the keyboard pressed sequence of the characters of the first type and permitting modification of the keyboard pressed sequence of the characters of the first type to initiate a search for and retrieval of characters of the second type.
    Type: Application
    Filed: January 20, 2012
    Publication date: July 25, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lei Chen, Jenny S. Li, Wen Hao Wang
  • Publication number: 20130185050
    Abstract: Converting technical data from field oriented electronic data sources into natural language form is disclosed. An approach includes obtaining document data from an input document, wherein the document data is in a non-natural language form. The approach includes determining a data type of the document data from one of a plurality of data types defined in a detection and conversion database. The approach includes translating the document data to a natural language form based on the determined data type. The approach additionally includes outputting the translated document data in natural language form to an output data stream.
    Type: Application
    Filed: January 13, 2012
    Publication date: July 18, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: John J. BIRD, Doyle J. MCCOY