Patents by Inventor Ankur Aher

Ankur Aher has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240153483
    Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
    Type: Application
    Filed: November 6, 2023
    Publication date: May 9, 2024
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Publication number: 20240153492
    Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
    Type: Application
    Filed: November 8, 2023
    Publication date: May 9, 2024
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Publication number: 20240040210
    Abstract: Methods and systems are described for providing content, such as a movie, with dialogue including a quotation that was input. For example, using a voice search a viewer may input a quotation famous from a movie to find the original fil and related content. The methods and systems use a quotation engine in a digital device to receive an input including the quotation and access a plurality of content items that include dialogue. The quotation engine identifies a subset of content items that include dialogue similar to the input quotation. The quotation engine accesses metadata of each of the subset of content, ranks the subset based on predetermined criteria and the metadata, and provides the ranked subset of the plurality of content items for consumption. The quotation engine may use a graphical user interface to identify the earliest release, trending content, or the program best known for the quote.
    Type: Application
    Filed: June 16, 2023
    Publication date: February 1, 2024
    Inventors: Ankur Aher, Nikhil Gabhane, Raman Gupta, Aman Puniyani
  • Patent number: 11889167
    Abstract: Systems and methods are provided for presenting an interactive content item matching a user-selected category to a user for a desired duration. A user selects a category and selects a first interactive content item on a media system. The system calculates a total duration of a storyline from the selected interactive content item that matches the selected category (e.g., a genre “comedy”) and compares the calculated duration to a desired predetermined duration for which the user wishes to watch the selected show. If the system determines, for instance, that the total duration of the selected storyline is less than the predetermined duration, the system identifies scenes from another show and interleaves them with scenes from the first interactive content item to generate a combined interactive content item that satisfies the user viewing preferences.
    Type: Grant
    Filed: September 3, 2021
    Date of Patent: January 30, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Sandeep Jangra, Aman Puniyani, Mohammed Yasir
  • Publication number: 20240005923
    Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
    Type: Application
    Filed: September 14, 2023
    Publication date: January 4, 2024
    Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
  • Patent number: 11854529
    Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
    Type: Grant
    Filed: November 10, 2022
    Date of Patent: December 26, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Patent number: 11842721
    Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
    Type: Grant
    Filed: August 5, 2022
    Date of Patent: December 12, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Patent number: 11790915
    Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
    Type: Grant
    Filed: March 7, 2023
    Date of Patent: October 17, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
  • Publication number: 20230267272
    Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user’s conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user’s request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user’s request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.
    Type: Application
    Filed: April 26, 2023
    Publication date: August 24, 2023
    Inventors: Ankur Aher, Susanto Sen
  • Publication number: 20230260514
    Abstract: The system provides a voice command recommendation to a user to avoid a non-voice command. The system determines a command that is expected to be received, and generates a voice command recommendation that corresponds to the predicted command. The predicted command can be based on the user's behavior, a plurality of users' behavior, environmental circumstances such as a phone call ring, or a combination thereof. The system may access one or more databases to determine the predicted command. The voice command recommendation may include a displayed notification that describes the recommended voice command, and exemplary voice inputs that are recognized. The system also activates an audio interface, such as a microphone, that is configured to receive a voice input. If the system receives a recognizable voice input at the audio interface that corresponds to the recommendation, the system performs the predicted command in response to receiving the voice input.
    Type: Application
    Filed: April 26, 2023
    Publication date: August 17, 2023
    Inventors: Jeffry Copps Robert Jose, Ankur Aher
  • Patent number: 11722749
    Abstract: Methods and systems are described for providing content, such as a movie, with dialogue including a quotation that was input. For example, using a voice search a viewer may input a quotation famous from a movie to find the original fil and related content. The methods and systems use a quotation engine in a digital device to receive an input including the quotation and access a plurality of content items that include dialogue. The quotation engine identifies a subset of content items that include dialogue similar to the input quotation. The quotation engine accesses metadata of each of the subset of content, ranks the subset based on predetermined criteria and the metadata, and provides the ranked subset of the plurality of content items for consumption. The quotation engine may use a graphical user interface to identify the earliest release, trending content, or the program best known for the quote.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: August 8, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Nikhil Gabhane, Raman Gupta, Aman Puniyani
  • Publication number: 20230206920
    Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
    Type: Application
    Filed: March 7, 2023
    Publication date: June 29, 2023
    Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
  • Patent number: 11676586
    Abstract: The system provides a voice command recommendation to a user to avoid a non-voice command. The system determines a command that is expected to be received, and generates a voice command recommendation that corresponds to the predicted command. The predicted command can be based on the user's behavior, a plurality of users' behavior, environmental circumstances such as a phone call ring, or a combination thereof. The system may access one or more databases to determine the predicted command. The voice command recommendation may include a displayed notification that describes the recommended voice command, and exemplary voice inputs that are recognized. The system also activates an audio interface, such as a microphone, that is configured to receive a voice input. If the system receives a recognizable voice input at the audio interface that corresponds to the recommendation, the system performs the predicted command in response to receiving the voice input.
    Type: Grant
    Filed: December 10, 2019
    Date of Patent: June 13, 2023
    Assignee: ROVI GUIDES, INC.
    Inventors: Jeffry Copps Robert Jose, Ankur Aher
  • Patent number: 11669685
    Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user's conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user's request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user's request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.
    Type: Grant
    Filed: October 9, 2020
    Date of Patent: June 6, 2023
    Assignee: ROVI GUIDES, INC.
    Inventors: Ankur Aher, Susanto Sen
  • Publication number: 20230146333
    Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
    Type: Application
    Filed: November 10, 2022
    Publication date: May 11, 2023
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Publication number: 20230140273
    Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
    Type: Application
    Filed: August 5, 2022
    Publication date: May 4, 2023
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Patent number: 11626113
    Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
    Type: Grant
    Filed: August 26, 2021
    Date of Patent: April 11, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
  • Patent number: 11527234
    Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
    Type: Grant
    Filed: October 1, 2019
    Date of Patent: December 13, 2022
    Assignee: ROVI GUIDES, INC.
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Patent number: 11494434
    Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata the indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities, which may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: November 8, 2022
    Assignee: ROVI GUIDES, INC.
    Inventors: Ankur Aher, Indranil Coomar Doss, Aashish Goyal, Aman Puniyani, Kandala Reddy, Mithun Umesh
  • Patent number: 11450306
    Abstract: The system trains a model to provide information used to provide a synthesized speech response to a voice input. The model takes as input prosodic information that may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example. The system receives a plurality of voice inputs, each associated with prosodic metric, as well as a plurality of responses, each also associated with prosodic metrics. The system trains the model based on the plurality of voice inputs, the plurality of responses, the prosodic metrics of the voice inputs, and the prosodic metrics of the responses such that the model outputs information used to generate the response. The model may also take as input user profile information, emotion metrics, and transition information to generate output. The output of the training model may be used by the system to provide synthesized speech responses having relevant prosodic character to received voice inputs.
    Type: Grant
    Filed: May 13, 2020
    Date of Patent: September 20, 2022
    Assignee: ROVl GUIDES, INC.
    Inventors: Ankur Aher, Jeffry Copps Robert Jose