Patents by Inventor David A. Winarsky

David A. Winarsky has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230134970
    Abstract: Systems and processes for generating audio books from text are provided. An example process includes, at an electronic device having one or more processors and memory: receiving a text including at least a first subset and a second subset, wherein at least a portion of the first subset overlaps with at least a portion of the second subset; determining, based on the text, a prosody for a speech output, wherein the prosody is representative of a genre; determining a semantic meaning of the text; and generating, based on the prosody and the semantic meaning, the speech output of the text.
    Type: Application
    Filed: October 31, 2022
    Publication date: May 4, 2023
    Inventors: Ramya RASIPURAM, William BECKMAN, Ladan GOLIPOUR, David A. WINARSKY, Cheng-Chieh YEH, Weicheng ZHANG
  • Publication number: 20230051062
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example, a plurality of speech inputs is received from a first user. A voice model is obtained based on the plurality of speech inputs. A user input is received from the first user, the user input corresponding to a request to provide access to the voice model. The voice model is provided to a second electronic device.
    Type: Application
    Filed: October 25, 2022
    Publication date: February 16, 2023
    Inventors: Qiong HU, Jiangchuan LI, David A. WINARSKY
  • Patent number: 11508380
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example, a plurality of speech inputs is received from a first user. A voice model is obtained based on the plurality of speech inputs. A user input is received from the first user, the user input corresponding to a request to provide access to the voice model. The voice model is provided to a second electronic device.
    Type: Grant
    Filed: May 26, 2020
    Date of Patent: November 22, 2022
    Assignee: Apple Inc.
    Inventors: Qiong Hu, Jiangchuan Li, David A. Winarsky
  • Publication number: 20210375290
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example, a plurality of speech inputs is received from a first user. A voice model is obtained based on the plurality of speech inputs. A user input is received from the first user, the user input corresponding to a request to provide access to the voice model. The voice model is provided to a second electronic device.
    Type: Application
    Filed: May 26, 2020
    Publication date: December 2, 2021
    Inventors: Qiong HU, Jiangchuan LI, David A. WINARSKY
  • Patent number: 9934775
    Abstract: Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.
    Type: Grant
    Filed: September 15, 2016
    Date of Patent: April 3, 2018
    Assignee: Apple Inc.
    Inventors: Tuomo J. Raitio, Kishore Sunkeswari Prahallad, Alistair D. Conkie, Ladan Golipour, David A. Winarsky
  • Publication number: 20170345411
    Abstract: Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.
    Type: Application
    Filed: September 15, 2016
    Publication date: November 30, 2017
    Inventors: Tuomo J. RAITIO, Kishore Sunkeswari PRAHALLAD, Alistair D. CONKIE, Ladan GOLIPOUR, David A. WINARSKY
  • Patent number: 9386154
    Abstract: The present invention provides a system, method and software application for enabling a customer service agent to efficiently communicate with users of a communication device. When a user enters speech input into his communication device, the speech is converted to text, and the text is displayed to the customer service agent on the agent's computer screen. Alternately, the user's speech input is provided to the customer service agent in the form of an audio file. The agent types a response, and the agent's response is provided to the user on the user's communication device. The agent's response may be converted to speech and played to the user, and/or the agent's response may be displayed as text on the display screen of the user's communication device.
    Type: Grant
    Filed: December 21, 2007
    Date of Patent: July 5, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Assaf Baciu, Kevin M. Stone, David A. Winarsky
  • Patent number: 8929883
    Abstract: A system, method and software application track call failures on a wireless phone. A software application on the phone monitors outbound calls made on the phone to determine whether or not a call fails. In one embodiment, such monitoring is done only for a select period of time (e.g., the initial few seconds when the phone is attempting to connect to a call to the wireless communication carrier's network), and, in an alternate embodiment, such monitoring is done for the duration of the call. If a call fails, the software application stores information related to the call. Examples of such types of information include the time the call failed, the location of the phone at call failure, and identification information for the phone. The stored call information is then sent to the wireless communication carrier to enable the carrier to assess network quality.
    Type: Grant
    Filed: November 26, 2012
    Date of Patent: January 6, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Assaf Baciu, Hetal Pandya, Kevin Stone, David Winarsky
  • Patent number: 8824651
    Abstract: The present invention provides a system, method and software application for associating one or more phone numbers with an action on a phone and performing an action on the phone in response to the user initiating a call to one of such phone numbers. Certain phone numbers are associated with an action. The user's phone, or a server with which the user's phone communicates, maintains a table of the phone numbers associated with an action, and each phone number in the table is related to an action. Software on the user's phone listens for call initiated by a user. When the user initiates a call, software on the phone “intercepts” the call, and the phone (or a server with which the phone communicates) determines whether the call is to a phone number that is in the table. If the phone number is in the table, then it is associated with an action, and the user's phone performs that action at a time specified by instructions for such action.
    Type: Grant
    Filed: January 15, 2013
    Date of Patent: September 2, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Carl Mikael Berner, Assaf Baciu, Sandeep M. Bhojwani, Jayadev Billa, Kevin M. Stone, David A. Winarsky
  • Publication number: 20130073286
    Abstract: Candidate interpretations resulting from application of speech recognition algorithms to spoken input are presented in a consolidated manner that reduces redundancy. A list of candidate interpretations is generated, and each candidate interpretation is subdivided into time-based portions, forming a grid. Those time-based portions that duplicate portions from other candidate interpretations are removed from the grid. A user interface is provided that presents the user with an opportunity to select among the candidate interpretations; the user interface is configured to present these alternatives without duplicate elements.
    Type: Application
    Filed: September 20, 2011
    Publication date: March 21, 2013
    Applicant: APPLE INC.
    Inventors: Marcello Bastea-Forte, David A. Winarsky
  • Patent number: 8363807
    Abstract: The present invention provides a system, method and software application for associating one or more phone numbers with an action on a phone and performing an action on the phone in response to the user initiating a call to one of such phone numbers. Certain phone numbers are associated with an action. The user's phone, or a server with which the user's phone communicates, maintains a table of the phone numbers associated with an action, and each phone number in the table is related to an action. Software on the user's phone listens for call initiated by a user. When the user initiates a call, software on the phone “intercepts” the call, and the phone (or a server with which the phone communicates) determines whether the call is to a phone number that is in the table. If the phone number is in the table, then it is associated with an action, and the user's phone performs that action at a time specified by instructions for such action.
    Type: Grant
    Filed: September 27, 2007
    Date of Patent: January 29, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Assaf Baciu, Sandeep M. Bhojwani, Jayadev Billa, Carl Mikael Berner, Kevin M. Stone, David A. Winarsky
  • Patent number: 8331919
    Abstract: A system, method and software application track call failures on a wireless phone. A software application on the phone monitors outbound calls made on the phone to determine whether or not a call fails. In one embodiment, such monitoring is done only for a select period of time (e.g., the initial few seconds when the phone is attempting to connect to a call to the wireless communication carrier's network), and, in an alternate embodiment, such monitoring is done for the duration of the call. If a call fails, the software application stores information related to the call. Examples of such types of information include the time the call failed, the location of the phone at call failure, and identification information for the phone. The stored call information is then sent to the wireless communication carrier to enable the carrier to assess network quality.
    Type: Grant
    Filed: April 24, 2009
    Date of Patent: December 11, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Assaf Baciu, Hetal Pandya, Kevin M. Stone, David A. Winarsky
  • Publication number: 20090164214
    Abstract: The present invention provides a system, method and software application for enabling a customer service agent to efficiently communicate with users of a communication device. When a user enters speech input into his communication device, the speech is converted to text, and the text is displayed to the customer service agent on the agent's computer screen. Alternately, the user's speech input is provided to the customer service agent in the form of an audio file. The agent types a response, and the agent's response is provided to the user on the user's communication device. The agent's response may be converted to speech and played to the user, and/or the agent's response may be displayed as text on the display screen of the user's communication device.
    Type: Application
    Filed: December 21, 2007
    Publication date: June 25, 2009
    Inventors: Assaf Baciu, Kevin M. Stone, David A. Winarsky
  • Publication number: 20090046843
    Abstract: The present invention provides a system, method and software application for associating one or more phone numbers with an action on a phone and performing an action on the phone in response to the user initiating a call to one of such phone numbers. Certain phone numbers are associated with an action. The user's phone, or a server with which the user's phone communicates, maintains a table of the phone numbers associated with an action, and each phone number in the table is related to an action. Software on the user's phone listens for call initiated by a user. When the user initiates a call, software on the phone “intercepts” the call, and the phone (or a server with which the phone communicates) determines whether the call is to a phone number that is in the table. If the phone number is in the table, then it is associated with an action, and the user's phone performs that action at a time specified by instructions for such action.
    Type: Application
    Filed: September 27, 2007
    Publication date: February 19, 2009
    Inventors: Assaf Baciu, Sandeep M. Bhojwani, Jayadev Billa, Carl Mikael Berner, Kevin M. Stone, David A. Winarsky