Handicap Aid Patents (Class 704/271)
  • Patent number: 9299358
    Abstract: A method for voice modification during a telephone call comprising receiving a source audio signal associated with at least one participant, wherein the source audio signal comprises a voice of the at least one participant, detecting a source dialect of the at least one participant, selecting a target dialect based on at least a characteristic of a target participant and creating a modulated audio signal based on the source audio signal, the source dialect, and the target dialect and transmitting the modulated audio signal to the target participant.
    Type: Grant
    Filed: August 7, 2013
    Date of Patent: March 29, 2016
    Assignee: Vonage America Inc.
    Inventor: Tzahi Efrati
  • Patent number: 9292764
    Abstract: A method for providing object information for a scene in a wearable computer is disclosed. In this method, an image of the scene is captured. Further, the method includes determining a current location of the wearable computer and a view direction of an image sensor of the wearable computer and extracting at least one feature from the image indicative of at least one object. Based on the current location, the view direction, and the at least one feature, information on the at least one object is determined. Then, the determined information is output.
    Type: Grant
    Filed: September 17, 2013
    Date of Patent: March 22, 2016
    Assignee: QUALCOMM Incorporated
    Inventors: Sungrack Yun, Kyu Woong Hwang, Jun-Cheol Cho, Taesu Kim, Minho Jin, Yongwoo Cho, Kang Kim
  • Patent number: 9280914
    Abstract: The present invention discloses a vision-aided hearing assisting device, which includes a display device, a microphone and a processing unit. The processing unit includes a receiving module, a message generating module and a display driving module. The processing unit is electrically connected to the display device and the microphone. The receiving module receives a surrounding sound signal, which is generated by the microphone. The message generating module analyzes the surrounding sound signal according to a present-scenario mode to generate a related message related with the surrounding sound signal. The display driving module drives the display device to display the related message.
    Type: Grant
    Filed: April 10, 2014
    Date of Patent: March 8, 2016
    Assignee: National Central University
    Inventors: Jia-Ching Wang, Chang-Hong Lin, Chih-Hao Shih
  • Patent number: 9218119
    Abstract: A computer device with a sensor subsystem for detecting off-surface objects, that carries out continued processing of the position and shape of objects detected in the vicinity of the device, associates these positions and shapes with predetermined gesture states, determines if the object is transitioning between gesture states and provides feedback based on the determined transition between the gesture states.
    Type: Grant
    Filed: March 25, 2011
    Date of Patent: December 22, 2015
    Assignee: BlackBerry Limited
    Inventors: Dan Gärdenfors, Karl-Anders Johansson, James Haliburton
  • Patent number: 9111545
    Abstract: The present invention relates to a hand-held communication aid and method that assists the deaf-dumb and visually impaired individuals to communicate with each other and with normal individuals. The method enables deaf-dumb and visually impaired individuals to communicate with each other and with normal individuals on remote communication means without any hardware improvization. The method enables face to face communication and remote communication aid for deaf-dumb and visually impaired individuals. This method requires no modifications in hand-held communication device used by normal individual.
    Type: Grant
    Filed: May 18, 2011
    Date of Patent: August 18, 2015
    Assignee: TATA CONSULTANCY SERVICES LIMITED
    Inventors: Charudatta Vitthal Jadhav, Bhushan Jagyasi
  • Patent number: 9057826
    Abstract: An optical apparatus includes an optical combiner, an image lens, and an external scene lens. The optical combiner has an eye-ward side and an external scene side and includes a partially reflective diffraction grating that is at least partially reflective to image light incident through the eye-ward side and at least partially transmissive to external scene light incident through the external scene side. A first mount is positioned to hold the image lens in an optical path of the image light to apply a first corrective prescription to the image light. A second mount is positioned to hold an external scene lens over the external scene side of the optical combiner to apply a second corrective prescription to the external scene light. The optical combiner combines the image light with the scene light to form a combined image that is corrected according to the first and second corrective prescriptions.
    Type: Grant
    Filed: January 31, 2013
    Date of Patent: June 16, 2015
    Assignee: Google Inc.
    Inventors: Anurag Gupta, Greg E. Priest-Dorman, Bernard C. Kress
  • Patent number: 9043204
    Abstract: Some embodiments of the inventive subject matter include a method for detecting speech loss and supplying appropriate recollection data to the user. Such embodiments include detecting a speech stream from a user, converting the speech stream to text, storing the text, detecting an interruption to the speech stream, wherein the interruption to the speech stream indicates speech loss by the user, searching a catalog using the text as a search parameter to find relevant catalog data and, presenting the relevant catalog data to remind the user about the speech stream.
    Type: Grant
    Filed: September 12, 2012
    Date of Patent: May 26, 2015
    Assignee: International Business Machines Corporation
    Inventor: Scott H. Berens
  • Patent number: 9026237
    Abstract: A system for generating audio impressions of data for a visually-impaired user. The system receives data that is displayable by a chart. The data comprises a plurality of values. The system generates an audio impression of the received data. The audio impression includes a first portion and a second portion. The first portion is based upon at least a first value of the received data. The second portion is based upon at least a second value of the received data. An audible difference between the first portion and the second portion reflects the magnitude of a difference between the first value and the second value.
    Type: Grant
    Filed: September 21, 2012
    Date of Patent: May 5, 2015
    Assignee: Oracle International Corporation
    Inventor: Lory D. Molesky
  • Publication number: 20150119635
    Abstract: A system, including a first prosthetic device configured to evoke a hearing percept based on a first ambient sound and a second non-invasive device configured to stimulate skin based on a second ambient sound generated by a voice.
    Type: Application
    Filed: October 25, 2013
    Publication date: April 30, 2015
    Inventors: Johan Gustafsson, Martin Hillbratt, Kristian Asnes, Marcus Andersson
  • Patent number: 8996387
    Abstract: For clearing transaction data selected for a processing, there is generated in a portable data carrier (1) a transaction acoustic signal (003; 103; 203) (S007; S107; S207) upon whose acoustic reproduction by an end device (10) at least transaction data selected for the processing are reproduced superimposed acoustically with a melody specific to a user of the data carrier (1) (S009; S109; S209). The generated transaction acoustic signal (003; 103; 203) is electronically transferred to an end device (10) (S108; S208), which processes the selected transaction data (S011; S121; S216) only when the user of the data carrier (1) confirms vis-à-vis the end device (10) an at least partial match both of the acoustically reproduced melody with the user-specific melody and of the acoustically reproduced transaction data with the selected transaction data (S010; S110, S116; S210).
    Type: Grant
    Filed: September 8, 2009
    Date of Patent: March 31, 2015
    Assignee: Giesecke & Devrient GmbH
    Inventors: Thomas Stocker, Michael Baldischweiler
  • Patent number: 8977550
    Abstract: Part units of speech information are arranged in a predetermined order to generate a sentence unit of a speech information set. To each of a plurality of speech part units of the speech information, an attribute of “interrupt possible after reproduction” with which reproduction of priority interrupt information can be started after the speech part unit of the speech information is reproduced or another attribute of “interrupt impossible after reproduction” with which reproduction of the priority interrupt information cannot be started even after the speech part unit of the speech information is reproduced is set. When the priority interrupt information having a high priority rank than the speech information set being currently reproduced is inputted, if the attribute of the speech information being reproduced at the point in time is “interrupt impossible after reproduction,” then the priority interrupt information is reproduced after the speech information is reproduced.
    Type: Grant
    Filed: May 6, 2011
    Date of Patent: March 10, 2015
    Assignee: Honda Motor Co., Ltd.
    Inventor: Tokujiro Kizaki
  • Patent number: 8965772
    Abstract: Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and displaying an icon representing the input state of the speech command. In typical embodiments, the icon representing a speech command type and the icon representing the input state of the speech command also includes attributes of a single icon. Typical embodiments include accepting from a user a speech command of the speech command type, changing the input state of the speech command, and displaying another icon representing the changed input state of the speech command. Typical embodiments also include displaying the text of the speech command in association with the icon representing the speech command type.
    Type: Grant
    Filed: March 20, 2014
    Date of Patent: February 24, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Charles W. Cross, Jr., Michael C. Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
  • Patent number: 8954334
    Abstract: A voice-activated pulser can trigger an oscilloscope or a meter, upon a simple voice command, thereby enabling hands-free signal measurements. The pulser can also be used to control the circuit under test, activating it or changing parameters, all under voice control. The pulser includes numerous switch-selectable output modes that allow users to generate complex, tightly-controlled diagnostic sequences, all activated upon a voice command and hands-free. The invention includes a fast, robust command-interpretation protocol that completely eliminates the expense and complexity of word recognition. Visual indicators display the device status and various operating modes, and also confirm each output pulse. The device receives voice commands directly through an internal microphone, or through a detachable headset, and confirms each command with an acoustical signal in the headset.
    Type: Grant
    Filed: October 15, 2011
    Date of Patent: February 10, 2015
    Assignee: Zanavox
    Inventor: David Edward Newman
  • Patent number: 8949123
    Abstract: The voice conversion method of a display apparatus includes: in response to the receipt of a first video frame, detecting one or more entities from the first video frame; in response to the selection of one of the detected entities, storing the selected entity; in response to the selection of one of a plurality of previously-stored voice samples, storing the selected voice sample in connection with the selected entity; and in response to the receipt of a second video frame including the selected entity, changing a voice of the selected entity based on the selected voice sample and outputting the changed voice.
    Type: Grant
    Filed: April 11, 2012
    Date of Patent: February 3, 2015
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Aditi Garg, Kasthuri Jayachand Yadlapalli
  • Patent number: 8949128
    Abstract: Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired speech output. The synthesis system selects one or more audio recordings corresponding to one or more portions of the text input. In one aspect, the synthesis system selects from audio recordings provided by a developer of the speech-enabled application. In another aspect, the synthesis system selects an audio recording of a speaker speaking a plurality of words. The synthesis system forms a speech output including the one or more selected audio recordings and provides the speech output for the speech-enabled application.
    Type: Grant
    Filed: February 12, 2010
    Date of Patent: February 3, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Darren C. Meyer, Corinne Bos-Plachez, Martine Marguerite Staessen
  • Patent number: 8949129
    Abstract: A method and apparatus are provided for processing a set of communicated signals associated with a set of muscles, such as the muscles near the larynx of the person, or any other muscles the person use to achieve a desired response. The method includes the steps of attaching a single integrated sensor, for example, near the throat of the person proximate to the larynx and detecting an electrical signal through the sensor. The method further includes the steps of extracting features from the detected electrical signal and continuously transforming them into speech sounds without the need for further modulation. The method also includes comparing the extracted features to a set of prototype features and selecting a prototype feature of the set of prototype features providing a smallest relative difference.
    Type: Grant
    Filed: August 12, 2013
    Date of Patent: February 3, 2015
    Assignee: Ambient Corporation
    Inventors: Michael Callahan, Thomas Coleman
  • Patent number: 8938382
    Abstract: An item of information (212) is transmitted to a distal computer (220), translated to a different sense modality and/or language (222), and in substantially real time, and the translation (222) is transmitted back to the location (211) from which the item was sent. The device sending the item is preferably a wireless device, and more preferably a cellular or other telephone (210). The device receiving the translation is also preferably a wireless device, and more preferably a cellular or other telephone, and may advantageously be the same device as the sending device. The item of information (212) preferably comprises a sentence of human of speech having at least ten words, and the translation is a written expression of the sentence. All of the steps of transmitting the item of information, executing the program code, and transmitting the translated information preferably occurs in less than 60 seconds of elapsed time.
    Type: Grant
    Filed: March 21, 2012
    Date of Patent: January 20, 2015
    Assignee: Ulloa Research Limited Liability Company
    Inventor: Robert D. Fish
  • Patent number: 8938394
    Abstract: A computing device includes at least one processor and at least one module, operable by the at least one processor, to determine a context of the computing device, the context including an indication of at least one of an application executing at the computing device and a location of the computing device and determine, based at least in part on the context, one or more contextual audio triggers usable to initiate interaction with the computing device, each of the one or more contextual audio triggers being associated with a respective operation of the computing device. The at least one module is further operable to receive audio data, and responsive to determining that a portion of the audio data corresponds to a particular contextual audio trigger from the one or more contextual audio triggers, perform the respective operation associated with the particular contextual audio trigger.
    Type: Grant
    Filed: January 9, 2014
    Date of Patent: January 20, 2015
    Assignee: Google Inc.
    Inventors: Alexander Faaborg, Daniel Marc Gatan Shiplacoff
  • Patent number: 8924218
    Abstract: An automated personal assistance system employing artificial intelligence technology that includes speech recognition and synthesis, situational awareness, pattern and behavioral recognition, and the ability to learn from the environment. Embodiments of the system include environmental and occupant sensors and environmental actuators interfaced to an assistance controller having the artificial intelligence technology incorporated therein to control the environment of the system. An embodiment of the invention is implemented as a vehicle which reacts to voice command for movement and operation of the vehicle and detects objects, obstructions, and distances. This invention provides the ability to monitor for the safety of operation and modify dangerous maneuvers as well as to learn locations in the environment and to automatically find its way to them. The system may also incorporate communication capability to convey patterns of environmental and occupant parameters and to a monitoring center.
    Type: Grant
    Filed: November 29, 2011
    Date of Patent: December 30, 2014
    Inventors: Greg L. Corpier, Katie J. Boyer
  • Patent number: 8920174
    Abstract: An electro-tactile display includes an electrode substrate provided with a plurality of stimulation electrodes, a conductive gel layer positioned between the stimulation electrodes and the skin of a wearer, a switching circuit section electrically connected to the stimulation electrodes, a stimulation pattern generating section electrically connected to the switching circuit, and means for alleviating a sensation experienced by the wearer as a result of the stimulation electrodes. In one aspect, the means for alleviating a sensation is configured from the conductive gel layer. The conductive gel layer has a resistance value equivalent to that of the horny layer of the skin. In another aspect, the means for alleviating a sensation is configured from the stimulation determination means and the threshold value adjustment means.
    Type: Grant
    Filed: December 7, 2006
    Date of Patent: December 30, 2014
    Assignees: The University of Tokyo, Eye Plus Plus, Inc.
    Inventors: Susumu Tachi, Hiroyuki Kajimoto, Yonezo Kanno
  • Publication number: 20140379352
    Abstract: Exemplary embodiments include an assistive device to facilitate social interactions in autistic individuals by identifying emotions using a voice-detecting machine learning algorithm that extracts emotion content from an audio sample input and outputs the emotional content to a user through a device. This device may be a portable, concealable, real-time and automatic device that may receive and process an audio input. The audio input may be analyzed using a machine learning algorithm. The device may output the closest emotional match to the autistic user. The output may be tactile in nature such as a vibration pattern that is different for different identified emotions.
    Type: Application
    Filed: June 16, 2014
    Publication date: December 25, 2014
    Inventors: Suhas Gondi, Andrea Shao-Yin Li, Maxinder S. Kanwal, Corwin de Boor, Muthuraman Chidambaram, Anand Prasanna, Jae Young Chang, Benjamin L. Hsu
  • Patent number: 8917822
    Abstract: A device and method for providing captioned services to an assisted user using a captioned device linkable via a first communication link to a hearing user's device where the method includes the steps of, at a relay, receiving a request for captioning service from the captioned device on a second communication link, in response to the request, setting up the captioning service at the relay including receiving hearing user voice signals from the captioned device, providing the voice signals to a call assistant to transcribe into text and transmitting the text back to the captioned device to display, wherein the step of receiving a request may be prior to establishment the first communication link and wherein the step of receiving a request may be subsequent to establishment of the first communication link.
    Type: Grant
    Filed: June 9, 2014
    Date of Patent: December 23, 2014
    Assignee: Ultratec, Inc.
    Inventors: Robert M Engelke, Kevin R Colwell
  • Patent number: 8918197
    Abstract: As the possible variations of “Hearing Thresholds”, “Hearing Loudness bandwidths” and “Voice Intonation” characteristics of people are finite, it is proposed to set a Database of these characteristics, where the data elements fully describe the Hearing and Talking characteristics of anyone, while many have the same characteristics. Thus any voice communication between two parties may be optimized by correcting the intensities of the call in the spectral domain, differently for each party and each ear. The optimizations are automatic given the “codes” of the parties and have a minimal latency. The system may be implemented either centrally in the world-wide-web or at the edges, in cellular phones, landline phones, VoIP, VoIM and in the audio parts of entertainment devices.
    Type: Grant
    Filed: June 13, 2012
    Date of Patent: December 23, 2014
    Inventor: Avraham Suhami
  • Patent number: 8914291
    Abstract: Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output, and inputs the text input to a speech synthesis system. The synthesis system generates an audio speech output corresponding to at least a portion of the text input, with at least one portion carrying contrastive stress, and provides the audio speech output for the speech-enabled application. In another aspect, a speech-enabled application inputs a plurality of text strings, each corresponding to a portion of a desired speech output, to a software module for rendering contrastive stress. The software module identifies a plurality of audio recordings that render at least one portion of at least one of the text strings as speech carrying contrastive stress. The speech-enabled application generates an audio speech output corresponding to the desired speech output using the audio recordings.
    Type: Grant
    Filed: September 24, 2013
    Date of Patent: December 16, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Darren C. Meyer, Stephen R. Springer
  • Patent number: 8909538
    Abstract: Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface by providing in combination with at least one user prompt seeking a voice response, an enhanced user keyword prompt intended to facilitate the user selecting a keyword to speak in response to the user prompt. The enhanced keyword prompts may be the same words as those a user can speak as a reply to the user prompt but presented using a different audio presentation method, e.g., speech rate, audio level, or speaker voice, than used for the user prompt. In some cases, the user keyword prompts are different words from the expected user response keywords, or portions of words, e.g., truncated versions of keywords.
    Type: Grant
    Filed: November 11, 2013
    Date of Patent: December 9, 2014
    Assignee: Verizon Patent and Licensing Inc.
    Inventor: James Mark Kondziela
  • Patent number: 8909523
    Abstract: A method determines a bias reduced noise and interference estimation in a binaural microphone configuration with a right and a left microphone signal at a time-frame with a target speaker active. The method includes a determination of the auto power spectral density estimate of the common noise formed of noise and interference components of the right and left microphone signals and a modification of the auto power spectral density estimate of the common noise by using an estimate of the magnitude squared coherence of the noise and interference components contained in the right and left microphone signals determined at a time frame without a target speaker active. An acoustic signal processing system and a hearing aid implement the method for determining the bias reduced noise and interference estimation. The noise reduction performance of speech enhancement algorithms is improved by the invention. Further, distortions of the target speech signal and residual noise and interference components are reduced.
    Type: Grant
    Filed: June 7, 2011
    Date of Patent: December 9, 2014
    Assignee: Siemens Medical Instruments Pte. Ltd.
    Inventors: Walter Kellermann, Klaus Reindl, Yuanhang Zheng
  • Patent number: 8908838
    Abstract: A system and method for providing captioned services comprising a relay, an assisted user's captioned device including a processor programmed to perform the steps of establishing a first communication link between the captioned device and a hearing person's device, receiving voice signals from the hearing person via the first communication link, receive an indication that an activator has been activated to invoke a captioning service and in response transmitting the hearing user's voice signals received at the captioned device to a relay via a second communication link, receiving text back corresponding to the hearing user's voice signals from the relay and displaying the text wherein the assisted user can invoke the captioning service either prior to or after the first communication link is established.
    Type: Grant
    Filed: June 9, 2014
    Date of Patent: December 9, 2014
    Assignee: Ultratec, Inc.
    Inventors: Robert M Engelke, Kevin R Colwell
  • Publication number: 20140358551
    Abstract: A speech aid system includes a tube for mounting at a tracheostomy of a user, a voice parameter acquiring device mounted to the tube and generating a voice parameter signal according to airflow applied within the tube resulting from attempt by the user to speak, a processor generating an audio signal corresponding to the voice parameter signal, and a sound generator for mounting in an oral cavity of the user. The sound generator produces a substitute glottal sound corresponding to the audio signal.
    Type: Application
    Filed: June 3, 2014
    Publication date: December 4, 2014
    Inventors: Ching-Feng LIU, Hsiao-Han CHEN
  • Patent number: 8888494
    Abstract: One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles and a set of information associated with each role in the set of roles. An active role in the set of roles that is associated with a given user is identified based on the analyzing. At least a portion of the manuscript is presented to the given user via a user interface. The portion includes at least a subset of information in the set of information. Information within the set of information that is associated with the active role is presented in a visually different manner than information within the set of information that is associated with a non-active role, which is a role that is associated with a user other than the given user.
    Type: Grant
    Filed: June 27, 2011
    Date of Patent: November 18, 2014
    Inventor: Randall Lee Threewits
  • Patent number: 8892232
    Abstract: The invention describes the proprietary activities, services and devices provided to a networked community of Hearing impaired people, that help improve wired, wireless and direct voice communications.
    Type: Grant
    Filed: November 20, 2012
    Date of Patent: November 18, 2014
    Inventor: Avraham Suhami
  • Patent number: 8868426
    Abstract: The amount of speech output to a blind or low-vision user using a screen reader application is automatically adjusted based on how the user navigates to a control in a graphic user interface. Navigation by mouse presumes the user has greater knowledge of the identity of the control than navigation by tab keystroke which is more indicative of a user searching for a control. In addition, accelerator keystrokes indicate a higher level of specificity to set focus on a control and thus less verbosity is required to sufficiently inform the screen reader user.
    Type: Grant
    Filed: August 23, 2012
    Date of Patent: October 21, 2014
    Assignee: Freedom Scientific, Inc.
    Inventors: Garald Lee Voorhees, Glen Gordon, Eric Damery
  • Patent number: 8868373
    Abstract: Disclosed are virtual reality systems, in particular immersive virtual reality systems, their parts, construction and use. The systems and/or parts thereof may be used by adults or children, and may be adapted to support, often within a single device, a large range of users of different sizes and medical condition. Users with physical disabilities have difficulties using existing immersive technologies such as those using accessories like head-mounted displays and data gloves. Such users are provided with immersive virtual reality outputs that allow them to see virtual representations of their body parts which appear in a correct spatial position relative to the users' viewpoint.
    Type: Grant
    Filed: August 19, 2009
    Date of Patent: October 21, 2014
    Assignee: Universitat Zurich Prorektorat MNW
    Inventors: Kynan Eng, Pawel Pyk, Edith Chevrier, Lisa Holper, Daniel Kiper
  • Patent number: 8855322
    Abstract: An original loudness level of an audio signal is maintained for a mobile device while maintaining sound quality as good as possible and protecting the loudspeaker used in the mobile device. The loudness of an audio (e.g., speech) signal may be maximized while controlling the excursion of the diaphragm of the loudspeaker (in a mobile device) to stay within the allowed range. In an implementation, the peak excursion is predicted (e.g., estimated) using the input signal and an excursion transfer function. The signal may then be modified to limit the excursion and to maximize loudness.
    Type: Grant
    Filed: August 9, 2011
    Date of Patent: October 7, 2014
    Assignee: QUALCOMM Incorporated
    Inventors: Sang-Uk Ryu, Jongwon Shin, Roy Silverstein, Andre Gustavo P. Schevciw, Pei Xiang
  • Patent number: 8843374
    Abstract: A link table is generated, voice information is associated by dot patterns, and then, voice information associated with the dot pattern is reproduced from a speaker when the dot pattern is read by means of a scanner. In this manner, the dot pattern is printed on a surface of a material such as a picture book or a card, making it possible to play back voice information corresponding to a pattern or a story of a picture book and to play back voice information corresponding to a character described on the card. In addition, by means of a link table, new voice information can be associated with, dissociated from, or changed to, a new dot pattern.
    Type: Grant
    Filed: May 19, 2006
    Date of Patent: September 23, 2014
    Inventor: Kenji Yoshida
  • Patent number: 8843368
    Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.
    Type: Grant
    Filed: August 17, 2009
    Date of Patent: September 23, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Yeon-Jun Kim, David C. Gibbon, Horst Schroeter
  • Patent number: 8838456
    Abstract: An image processing apparatus including: image processor which processes broadcasting signal, to display image based on processed broadcasting signal; communication unit which is connected to a server; a voice input unit which receives a user's speech; a voice processor which processes a performance of a preset corresponding operation according to a voice command corresponding to the speech; and a controller which processes the voice command corresponding to the speech through one of the voice processor and the server if the speech is input through the voice input unit. If the voice command includes a keyword relating to a call sign of a broadcasting channel, the controller controls one of the voice processor and the server to select a recommended call sign corresponding to the keyword according to a predetermined selection condition, and performs a corresponding operation under the voice command with respect to the broadcasting channel of the recommended call sign.
    Type: Grant
    Filed: May 14, 2013
    Date of Patent: September 16, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Joo-yeong Lee, Sang-shin Park
  • Patent number: 8838454
    Abstract: A method of processing a call in a voice-command platform includes a step of transferring the call from the voice-command platform to a second voice-command platform. The method continues with the step of transmitting, either directly or indirectly, grammar information from the voice command platform to the second voice-command platform for use by a voice command application executing in the second voice-command platform in processing the call. The grammar information could be logic defining application-level grammar or system-level grammar. Alternatively, the grammar information could be a network address (e.g., URI or URL) where the grammar is stored in a file, e.g., a VXML document. The features of this invention enhance the user experience by preserving and using grammars used initially in the first voice command platform in other, downstream, voice command platforms.
    Type: Grant
    Filed: December 10, 2004
    Date of Patent: September 16, 2014
    Assignee: Sprint Spectrum L.P.
    Inventor: Balaji S. Thenthiruperai
  • Patent number: 8826137
    Abstract: A screen reader software product for low-vision users, the software having a reader module collecting textual and non-textual display information generated by a web browser or word processor. Font styling, interface layout information and the like are communicated to the end user by sounds broadcast simultaneously rather than serially with the synthesized speech to improve the speed and efficiency in which information may be digested by the end user.
    Type: Grant
    Filed: August 12, 2004
    Date of Patent: September 2, 2014
    Assignee: Freedom Scientific, Inc.
    Inventors: Christian D. Hofstader, Glen Gordon, Eric Damery, Ralph Ocampo, David Baker, Joseph K. Stephen
  • Patent number: 8825486
    Abstract: Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output, and inputs the text input to a speech synthesis system. The synthesis system generates an audio speech output corresponding to at least a portion of the text input, with at least one portion carrying contrastive stress, and provides the audio speech output for the speech-enabled application. In another aspect, a speech-enabled application inputs a plurality of text strings, each corresponding to a portion of a desired speech output, to a software module for rendering contrastive stress. The software module identifies a plurality of audio recordings that render at least one portion of at least one of the text strings as speech carrying contrastive stress. The speech-enabled application generates an audio speech output corresponding to the desired speech output using the audio recordings.
    Type: Grant
    Filed: January 22, 2014
    Date of Patent: September 2, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Darren C. Meyer, Stephen R. Springer
  • Patent number: 8825491
    Abstract: An auditory user interactive interface to an application program being installed in the computer controlled system. A routine in an object, in an application program being installed in the computer controlled system for providing an auditory user interface to the program in combination with auditory means for offering the user of the computer controlled system the auditory user interface during installation of said application program, and responsive to the selection of the auditory interface provides the auditory user interface during said installation of the application program.
    Type: Grant
    Filed: September 30, 2013
    Date of Patent: September 2, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Peter Thomas Brunet, Anh Quy Lu, Mark Edward Nosewicz, Lawrence Frank Weiss
  • Patent number: 8805685
    Abstract: Disclosed herein are systems, methods, and tangible computer readable-media for detecting synthetic speaker verification. The method comprises receiving a plurality of speech samples of the same word or phrase for verification, comparing each of the plurality of speech samples to each other, denying verification if the plurality of speech samples demonstrate little variance over time or are the same, and verifying the plurality of speech samples if the plurality of speech samples demonstrates sufficient variance over time. One embodiment further adds that each of the plurality of speech samples is collected at different times or in different contexts. In other embodiments, variance is based on a pre-determined threshold or the threshold for variance is adjusted based on a need for authentication certainty. In another embodiment, if the initial comparison is inconclusive, additional speech samples are received.
    Type: Grant
    Filed: August 5, 2013
    Date of Patent: August 12, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventor: Horst J. Schroeter
  • Patent number: 8803939
    Abstract: A method and apparatus for realizing videophone are provided in the present invention. The method includes setting different videophone modes; and selecting the videophone mode for a user to perform a video conversation by using the selected videophone mode. The apparatus includes a setting module and a control module which is connected with the setting module. With the method and apparatus for realizing videophone according to the present invention, the user can select a normal conversation or a preset audio/video mode according to requirements in the process of the videophone conversation so as to achieve different video effects.
    Type: Grant
    Filed: October 19, 2010
    Date of Patent: August 12, 2014
    Assignee: ZTE Corporation
    Inventor: Tao Xue
  • Patent number: 8787531
    Abstract: Instant messaging (IM) is provided between a TDD/TTY user and an entity. The user may use a TDD device to initiate a call with the entity. One or more converters may convert a TDD message from the user's device to IM, which is then provided to a recipient of the call, such as a representative of a company. The converter(s) may also convert IM from the representative into a TDD message that may then be provided to the user on the TDD device.
    Type: Grant
    Filed: December 10, 2012
    Date of Patent: July 22, 2014
    Assignee: United Services Automobile Association (USAA)
    Inventor: Dena L. Smith
  • Publication number: 20140195245
    Abstract: A vocal fold movement translation device includes a reverse motion linkage configured to interface with a first vocal fold and a second vocal fold. The reverse motion linkage includes a mechanical component configured to move the first vocal fold in a first movement direction in response to movement of the second vocal fold in a second movement direction that is opposite the first movement direction. The reverse motion linkage is movable between a first configuration corresponding to an abducted position of the first and second vocal folds and a second configuration corresponding to an adducted position of the first and second vocal folds.
    Type: Application
    Filed: January 9, 2013
    Publication date: July 10, 2014
    Inventors: Derek Roe Eller, Erik M. Meyer
  • Patent number: 8775180
    Abstract: Apparatus and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice interaction. In one aspect of the invention, a communications system includes a user interface, a communications network, and a call center having an automatic speech recognition component. In other aspects of the invention, a script compliance method includes the steps of conducting a voice interaction between an agent and a client and evaluating the voice interaction with an automatic speech recognition component adapted to analyze the voice interaction and determine whether the agent has adequately followed the script. In yet still further aspects of the invention, the duration of a given interaction can be analyzed, either apart from or in combination with the script compliance analysis above, to seek to identify instances of agent non-compliance, of fraud, or of quality-analysis issues.
    Type: Grant
    Filed: November 26, 2012
    Date of Patent: July 8, 2014
    Assignee: West Corporation
    Inventors: Mark J. Pettay, Fonda J. Narke
  • Patent number: 8743388
    Abstract: A method for controlling a peripheral device includes receiving input from a user at a workstation adapted to the user, determining whether the received input can be valid, generating a job ticket from the valid input, sending the job ticket to the peripheral device and receiving an identifier representing the job ticket from the peripheral device.
    Type: Grant
    Filed: October 31, 2006
    Date of Patent: June 3, 2014
    Assignee: Lexmark International, Inc.
    Inventors: Mohamed Nooman Ahmed, Amanda Kay Bridges, Stuart Willard Daniel, William James Gardner Flowers, Charles Edward Grieshaber, Dennis Herbert Hasselbring, Michael Earl Lhamon, Chad Eugene McQuillen, Michael Ray Timperman
  • Patent number: 8744852
    Abstract: A spoken interface is described for assisting a visually impaired user to obtain audible information and interact with elements displayed on a display screen. The spoken interface also enables access and control of other elements that are hidden by other windows. The interface receives user input data representing user inputs received by an input device and uses a movable selector to select an element of an application. The element selected by the selector may be either an editing type element or non-editing type element. The interface provides audio information regarding the selected editing or non-editing element and enables interaction with the selected element.
    Type: Grant
    Filed: December 20, 2006
    Date of Patent: June 3, 2014
    Assignee: Apple Inc.
    Inventors: Eric T. Seymour, Richard W. Fabrick, II, Patti P. Yeh, John O. Louch
  • Publication number: 20140136209
    Abstract: A voice-assisted biomedical measurement apparatus is revealed. The voice-assisted biomedical measurement apparatus allows users to get measurement results of biological signals and other assistant information in aural form. The voice-assisted biomedical measurement apparatus consists of a sensing unit, a control unit, a voice module and a speaker. The voice-assisted biomedical measurement apparatus further includes a display unit, an operation unit, a memory unit and a data transmission unit. The voice-assisted biomedical measurement apparatus features on that the voice module combines at least one first voice data to form a sentence according to at least one first grammar data when a first control signal is sent from the control unit to the voice module. Moreover, the voice-assisted biomedical measurement apparatus gets at least one second voice data and at least one second grammar data from an external data system through a data transmission unit.
    Type: Application
    Filed: May 16, 2013
    Publication date: May 15, 2014
    Applicant: HEALTH & LIFE CO., LTD.
    Inventors: MENG-YI LIN, SHAO-HUNG LEE
  • Patent number: 8719034
    Abstract: Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and displaying an icon representing the input state of the speech command. In typical embodiments, the icon representing a speech command type and the icon representing the input state of the speech command also includes attributes of a single icon. Typical embodiments include accepting from a user a speech command of the speech command type, changing the input state of the speech command, and displaying another icon representing the changed input state of the speech command. Typical embodiments also include displaying the text of the speech command in association with the icon representing the speech command type.
    Type: Grant
    Filed: September 13, 2005
    Date of Patent: May 6, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Charles W. Cross, Jr., Michael Charles Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
  • Patent number: 8712780
    Abstract: A picture based communication system and mechanisms of implementation thereof allowing for rapid translation of picture based input into words or sentences of a previously chosen output language. Communication systems may be incorporated on PCs, mobile devices or may be a software running on a remote system which allows for language-independent messages to be constructed, which can be de-constructed into any language on the receiver's side. Mechanisms of implementation would also be of assistance in allowing people with language difficulties, dyslexia or illiteracy to communicate effectively.
    Type: Grant
    Filed: December 8, 2011
    Date of Patent: April 29, 2014
    Assignee: Invention Labs Engineering Products Pvt. Ltd.
    Inventor: Ajit Narayanan