Patents by Inventor Ankur Aher

Ankur Aher has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for generating synthesized speech responses to voice inputs by training a neural network model based on the voice input prosodic metrics and training voice inputs

Patent number: 11443731

Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.

Type: Grant

Filed: May 13, 2020

Date of Patent: September 13, 2022

Assignee: ROVI GUIDES, INC.

Inventors: Ankur Aher, Jeffry Copps Robert Jose
SYSTEMS AND METHODS FOR DYNAMICALLY ADJUSTING QUALITY LEVELS FOR TRANSMITTING CONTENT BASED ON CONTEXT

Publication number: 20220264170

Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.

Type: Application

Filed: May 9, 2022

Publication date: August 18, 2022

Inventors: Ankur Aher, Charishma Chundi
SYSTEMS AND METHODS FOR PROVIDING UNINTERRUPTED MEDIA CONTENT DURING VEHICLE NAVIGATION

Publication number: 20220252412

Abstract: Systems and methods are disclosed herein for selecting alternate routes with fewer directions during vehicle navigation. The disclosed techniques herein determine directions from route data and direction timestamps for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the direction timestamp and a subsequent direction timestamp.

Type: Application

Filed: April 28, 2022

Publication date: August 11, 2022

Inventors: Nishchit Mahajan, Ankur Aher
Systems and methods for managing voice queries using pronunciation information

Patent number: 11410656

Abstract: The system identifies one or more entities or content items among a plurality of stored information. The system generates an audio file based on a first text string that represents the entity or content item. Based on the first text string and at least one speech criterion, the system generating, using a speech-to-text module a second text string based on the audio file. The system then compares the text strings and stores the second text string if it is not identical to the first text string. The system generates metadata that includes results from text-speech-text conversions to forecast possible misidentifications when responding to voice queries during search operations. The metadata includes alternative representations of the entity.

Type: Grant

Filed: July 31, 2019

Date of Patent: August 9, 2022

Assignee: ROVI GUIDES, INC.

Inventors: Ankur Aher, Indranil Coomar Doss, Aashish Goyal, Aman Puniyani, Kandala Reddy, Mithun Umesh
Systems and methods for providing uninterrupted media content during vehicle navigation

Patent number: 11402231

Abstract: Systems and methods are disclosed herein for providing uninterrupted media content by reordering playlists during vehicle navigation. The disclosed techniques herein determine directions from route data and navigation announcements for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the navigation announcement and a subsequent navigation announcement.

Type: Grant

Filed: August 30, 2019

Date of Patent: August 2, 2022

Assignee: Rovi Guides, Inc.

Inventors: Nishchit Mahajan, Ankur Aher
Systems and methods for dynamically adjusting quality levels for transmitting content based on context

Patent number: 11356725

Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.

Type: Grant

Filed: October 16, 2020

Date of Patent: June 7, 2022

Assignee: Rovi Guides, Inc.

Inventors: Ankur Aher, Charishma Chundi
Systems and methods for providing uninterrupted media content during vehicle navigation

Patent number: 11340085

Abstract: Systems and methods are disclosed herein for selecting alternate routes with fewer directions during vehicle navigation. The disclosed techniques herein determine directions from route data and direction timestamps for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the direction timestamp and a subsequent direction timestamp.

Type: Grant

Filed: August 30, 2019

Date of Patent: May 24, 2022

Assignee: Rovi Guides, Inc.

Inventors: Nishchit Mahajan, Ankur Aher
SYSTEMS AND METHODS FOR DYNAMICALLY ADJUSTING QUALITY LEVELS FOR TRANSMITTING CONTENT BASED ON CONTEXT

Publication number: 20220124397

Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.

Type: Application

Filed: October 16, 2020

Publication date: April 21, 2022

Inventors: Ankur Aher, Charishma Chundi
SYSTEM AND METHODS TO HANDLE CONDITIONAL REQUESTS FOR LIVE PROGRAMS

Publication number: 20220114339

Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user's conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user's request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user's request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.

Type: Application

Filed: October 9, 2020

Publication date: April 14, 2022

Inventors: Ankur Aher, Susanto Sen
Systems and methods for providing uninterrupted media content during vehicle navigation

Patent number: 11248927

Abstract: Systems and methods are disclosed herein for providing uninterrupted media content during vehicle navigation. The disclosed techniques herein discuss determining directions from route data and navigation announcements for each of the directions. For each navigation announcement, a determination is made whether current playback of a media asset in a playlist ends within a predefined time threshold before the navigation announcement. In a positive determination, the playback of the playlist is paused until the navigation announcement has elapsed.

Type: Grant

Filed: August 30, 2019

Date of Patent: February 15, 2022

Assignee: Rovi Guides, Inc.

Inventors: Nishchit Mahajan, Ankur Aher
Systems and methods for disambiguating a voice search query based on gestures

Patent number: 11227593

Abstract: Systems and methods are described herein for disambiguating a voice search query by determining whether the user made a gesture while speaking a quotation from a content item and whether the user mimicked or approximated a gesture made by a character in the content item when the character spoke the words quoted by the user. If so, a search result comprising an identifier of the content item is generated. A search result representing the content item from which the quotation comes may be ranked highest among other search results returned and therefore presented first in a list of search results. If the user did not mimic or approximate a gesture made by a character in the content item when the quotation is spoken in the content item, then a search result may not be generated for the content item or may be ranked lowest among other search results.

Type: Grant

Filed: June 28, 2019

Date of Patent: January 18, 2022

Assignee: ROVI GUIDES, INC.

Inventors: Ankur Aher, Nishchit Mahajan, Narendra Purushothama, Sai Durga Venkat Reddy Pulikunta
SYSTEMS AND METHODS FOR DISPLAYING INTERACTIVE CONTENT ITEM FOR A PREDETERMINED DURATION

Publication number: 20210400360

Abstract: Systems and methods are provided for presenting an interactive content item matching a user-selected category to a user for a desired duration. A user selects a category and selects a first interactive content item on a media system. The system calculates a total duration of a storyline from the selected interactive content item that matches the selected category (e.g., a genre “comedy”) and compares the calculated duration to a desired predetermined duration for which the user wishes to watch the selected show. If the system determines, for instance, that the total duration of the selected storyline is less than the predetermined duration, the system identifies scenes from another show and interleaves them with scenes from the first interactive content item to generate a combined interactive content item that satisfies the user viewing preferences.

Type: Application

Filed: September 3, 2021

Publication date: December 23, 2021

Inventors: Ankur Aher, Sandeep Jangra, Aman Puniyani, Mohammed Yasir
Method and apparatus for generating hint words for automated speech recognition

Patent number: 11205430

Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.

Type: Grant

Filed: October 1, 2019

Date of Patent: December 21, 2021

Assignee: ROVI GUIDES, INC.

Inventors: Ankur Aher, Jeffry Copps Robert Jose
SYSTEMS AND METHODS FOR DISAMBIGUATING A VOICE SEARCH QUERY

Publication number: 20210390954

Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.

Type: Application

Filed: August 26, 2021

Publication date: December 16, 2021

Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
SYSTEMS AND METHODS FOR GENERATING SYNTHESIZED SPEECH RESPONSES TO VOICE INPUTS

Publication number: 20210319779

Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.

Type: Application

Filed: May 13, 2020

Publication date: October 14, 2021

Inventors: Ankur Aher, Jeffry Copps Robert Jose
SYSTEMS AND METHODS FOR GENERATING SYNTHESIZED SPEECH RESPONSES TO VOICE INPUTS

Publication number: 20210319780

Abstract: The system trains a model to provide information used to provide a synthesized speech response to a voice input. The model takes as input prosodic information that may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example. The system receives a plurality of voice inputs, each associated with prosodic metric, as well as a plurality of responses, each also associated with prosodic metrics. The system trains the model based on the plurality of voice inputs, the plurality of responses, the prosodic metrics of the voice inputs, and the prosodic metrics of the responses such that the model outputs information used to generate the response. The model may also take as input user profile information, emotion metrics, and transition information to generate output. The output of the training model may be used by the system to provide synthesized speech responses having relevant prosodic character to received voice inputs.

Type: Application

Filed: May 13, 2020

Publication date: October 14, 2021

Inventors: Ankur Aher, Jeffry Copps Robert Jose
Systems and methods for displaying interactive content item for a predetermined duration

Patent number: 11140463

Abstract: Systems and methods are provided for presenting an interactive content item matching a user-selected category to a user for a desired duration. A user selects a category and selects a first interactive content item on a media system. The system calculates a total duration of a storyline from the selected interactive content item that matches the selected category (e.g., a genre “comedy”) and compares the calculated duration to a desired predetermined duration for which the user wishes to watch the selected show. If the system determines, for instance, that the total duration of the selected storyline is less than the predetermined duration, the system identifies scenes from another show and interleaves them with scenes from the first interactive content item to generate a combined interactive content item that satisfies the user viewing preferences.

Type: Grant

Filed: June 28, 2019

Date of Patent: October 5, 2021

Assignee: Rovi Guides, Inc.

Inventors: Ankur Aher, Sandeep Jangra, Aman Puniyani, Mohammed Yasir
Systems and methods for disambiguating a voice search query

Patent number: 11133005

Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.

Type: Grant

Filed: April 29, 2019

Date of Patent: September 28, 2021

Assignee: Rovi Guides, Inc.

Inventors: Ankur Aher, Sindhuja Chonat Sri, Aman Puniyani, Nishchit Mahajan
SYSTEMS AND METHODS FOR PROVIDING VOICE COMMAND RECOMMENDATIONS

Publication number: 20210174795

Abstract: The system provides a voice command recommendation to a user to avoid a non-voice command. The system determines a command that is expected to be received, and generates a voice command recommendation that corresponds to the predicted command. The predicted command can be based on the user's behavior, a plurality of users' behavior, environmental circumstances such as a phone call ring, or a combination thereof. The system may access one or more databases to determine the predicted command. The voice command recommendation may include a displayed notification that describes the recommended voice command, and exemplary voice inputs that are recognized. The system also activates an audio interface, such as a microphone, that is configured to receive a voice input. If the system receives a recognizable voice input at the audio interface that corresponds to the recommendation, the system performs the predicted command in response to receiving the voice input.

Type: Application

Filed: December 10, 2019

Publication date: June 10, 2021

Inventors: Jeffry Copps Robert Jose, Ankur Aher
METHOD AND APPARATUS FOR GENERATING HINT WORDS FOR AUTOMATED SPEECH RECOGNITION

Publication number: 20210097988

Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.

Type: Application

Filed: October 1, 2019

Publication date: April 1, 2021

Inventors: Ankur Aher, Jeffry Copps Robert Jose

prev 1 2 3 next