Patents by Inventor Rohit Prasad

Rohit Prasad has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Active error detection and resolution for linguistic translation

Patent number: 9710463

Abstract: A two-way speech-to-speech (S2S) translation system actively detects a wide variety of common error types and resolves them through user-friendly dialog with the user(s). Examples include features including one or more of detecting out-of-vocabulary (OOV) named entities and terms, sensing ambiguities, homophones, idioms, ill-formed input, etc. and interactive strategies for recovering from such errors. In some examples, different error types are prioritized and systems implementing the approach can include an extensible architecture for implementing these decisions.

Type: Grant

Filed: December 6, 2013

Date of Patent: July 18, 2017

Assignee: Raytheon BBN Technologies Corp.

Inventors: Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Matthew Roy, Frederick Choi
Audio output masking for improved automatic speech recognition

Patent number: 9704478

Abstract: Features are disclosed for filtering portions of an output audio signal in order to improve automatic speech recognition on an input signal which may include a representation of the output signal. A signal that includes audio content can be received, and a frequency or band of frequencies can be selected to be filtered from the signal. The frequency band may correspond to a desired frequency band for speech recognition. An input signal can be obtained comprising audio data corresponding to a user utterance and presentation of the output signal. Automatic speech recognition can be performed on the input signal. In some cases, an acoustic model trained for use with such frequency band filtering may be used to perform speech recognition.

Type: Grant

Filed: December 2, 2013

Date of Patent: July 11, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Shiv Naga Prasad Vitaladevuni, Amit Singh Chhetri, Phillip Ryan Hilmes, Rohit Prasad
Keyword detection modeling using contextual and environmental information

Patent number: 9697828

Abstract: Features are disclosed for detecting words in audio using environmental information and/or contextual information in addition to acoustic features associated with the words to be detected. A detection model can be generated and used to determine whether a particular word, such as a keyword or “wake word,” has been uttered. The detection model can operate on features derived from an audio signal, contextual information associated with generation of the audio signal, and the like. In some embodiments, the detection model can be customized for particular users or groups of users based usage patterns associated with the users.

Type: Grant

Filed: June 20, 2014

Date of Patent: July 4, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Rohit Prasad, Kenneth John Basye, Spyridon Matsoukas, Rajiv Ramachandran, Shiv Naga Prasad Vitaladevuni, Bjorn Hoffmeister
Estimating false rejection rate in a detection system

Patent number: 9589560

Abstract: Features are disclosed for estimating a false rejection rate in a detection system. The false rejection rate can be estimated by fitting a model to a distribution of detection confidence scores. An estimated false rejection rate can then be computed for confidence scores that fall below a threshold. The false rejection rate and model can be verified once the detection system has been deployed by obtaining additional data with confidence scores falling below the threshold. Adjustments to the model or other operational parameters can be implemented based on the verified false rejection rate, model, or additional data.

Type: Grant

Filed: December 19, 2013

Date of Patent: March 7, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Shiv Naga Prasad Vitaladevuni, Bjorn Hoffmeister, Rohit Prasad
Documenting test room configurations

Patent number: 9441951

Abstract: Techniques are described for documenting the positions of items in a room, such as in rooms that are configured for testing automated systems that perform position-related functions. A non-contact measuring tool may be placed at different reference positions within the room. At each position, measurements are made to the room corners and to items of interest within the room. Based on this information, coordinates of the reference positions are calculated. Coordinates of the items are calculated based on the determined coordinates of the reference positions.

Type: Grant

Filed: November 25, 2013

Date of Patent: September 13, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Shiv Naga Prasad Vitaladevuni, Janet Louise Slifka, Rohit Prasad
Image processing using multiple aspect ratios

Patent number: 9418283

Abstract: A system to recognize text, objects, or symbols in a captured image using machine learning models reduces computational overhead by generating a plurality of thumbnail versions of the image at different downscaled resolutions and aspect ratios, and then processing the downscaled images instead of the entire image, or sections of the entire image. The downscaled images are processed to produce a combine feature vector characterizing the overall image. The combined feature vector is processed using the machine learning model.

Type: Grant

Filed: August 20, 2014

Date of Patent: August 16, 2016

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Pradeep Natarajan, Avnish Sikka, Rohit Prasad
Preventing false wake word detections with a voice-controlled device

Patent number: 9368105

Abstract: Natural language controlled devices may be configured to activate command recognition in response to one or more wake words. Techniques are provided to allow for multiple operating modes in which different recognition parameters are employed in recognizing wake words that activate the natural language control functionality of a computing device.

Type: Grant

Filed: June 26, 2014

Date of Patent: June 14, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Ian W. Freed, William Folwell Barton, Rohit Prasad
Fast computation of kernel descriptors

Patent number: 9330332

Abstract: An approach to computation of kernel descriptors is accelerated using precomputed tables. In one aspect, a fast algorithm for kernel descriptor computation that takes O(1) operations per pixel in each patch, based on pre-computed kernel values. This speeds up the kernel descriptor features under consideration, to levels that are comparable with D-SIFT and color SIFT, and two orders of magnitude faster than STIP and HoG3D. In some examples, kernel descriptors are applied to extract gradient, flow and texture based features for video analysis. In tests of the approach on a large database of internet videos used in the TRECVID MED 2011 evaluations, the flow based kernel descriptors are up to two orders of magnitude faster than STIP and HoG3D, and also produce significant performance improvements. Further, using features from multiple color planes produces small but consistent gains.

Type: Grant

Filed: October 4, 2013

Date of Patent: May 3, 2016

Assignee: Raytheon BBN Technologies Corp.

Inventors: Pradeep Natarajan, Shuang Wu, Rohit Prasad, Premkumar Natarajan
Text orientation estimation in camera captured OCR

Patent number: 9224061

Abstract: A system estimates text orientation in images captured using a handheld camera prior detecting text in the image. Text orientation is estimated based on edges detected within the image, and the image is rotated based on the estimated orientation. Text detection and processing is then performed on the rotated image. Non-text features along a periphery of the image may be sampled to assure that clutter will not undermine the estimation of orientation.

Type: Grant

Filed: August 20, 2014

Date of Patent: December 29, 2015

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Pradeep Natarajan, Avnish Sikka, Rohit Prasad
Segmentation co-clustering

Patent number: 9224207

Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.

Type: Grant

Filed: September 17, 2013

Date of Patent: December 29, 2015

Assignee: RAYTHEON BBN TECHNOLOGIES CORP.

Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
Efficient inner product computation for image and video analysis

Patent number: 8953892

Abstract: A computationally efficient approach to determining inner products between feature vectors is provided that eliminates or reduces the need for multiplication, and more specifically, provides an efficient and accurate basis selection for techniques such as Orthogonal Matching Pursuit.

Type: Grant

Filed: November 5, 2012

Date of Patent: February 10, 2015

Assignee: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
Image analysis using coefficient distributions with selective basis feature representation

Patent number: 8861872

Abstract: Distributional information for a set of ? vectors is determined using a sparse basis selection approach to representing an input image or video. In some examples, this distributional information is used for a classification task.

Type: Grant

Filed: November 5, 2012

Date of Patent: October 14, 2014

Assignee: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
ACTIVE ERROR DETECTION AND RESOLUTION FOR LINGUISTIC TRANSLATION

Publication number: 20140297252

Abstract: A two-way speech-to-speech (S2S) translation system actively detects a wide variety of common error types and resolves them through user-friendly dialog with the user(s). Examples include features including one or more of detecting out-of-vocabulary (OOV) named entities and terms, sensing ambiguities, homophones, idioms, ill-formed input, etc. and interactive strategies for recovering from such errors. In some examples, different error types are prioritized and systems implementing the approach can include an extensible architecture for implementing these decisions.

Type: Application

Filed: December 6, 2013

Publication date: October 2, 2014

Inventors: Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Matthew Roy, Frederick Choi
IMAGE ANALYSIS USING COEFFICIENT DISTRIBUTIONS WITH SELECTIVE BASIS FEATURE REPRESENTATION

Publication number: 20140126817

Abstract: Distributional information for a set of ? vectors is determined using a sparse basis selection approach to representing an input image or video. In some examples, this distributional information is used for a classification task.

Type: Application

Filed: November 5, 2012

Publication date: May 8, 2014

Applicant: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
EFFICIENT INNER PRODUCT COMPUTATION FOR IMAGE AND VIDEO ANALYSIS

Publication number: 20140126824

Abstract: A computationally efficient approach to determining inner products between feature vectors is provided that eliminates or reduces the need for multiplication, and more specifically, provides an efficient and accurate basis selection for techniques such as Orthogonal Matching Pursuit.

Type: Application

Filed: November 5, 2012

Publication date: May 8, 2014

Applicant: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
FAST COMPUTATION OF KERNEL DESCRIPTORS

Publication number: 20140099033

Abstract: An approach to computation of kernel descriptors is accelerated using precomputed tables. In one aspect, a fast algorithm for kernel descriptor computation that takes O(1) operations per pixel in each patch, based on pre-computed kernel values. This speeds up the kernel descriptor features under consideration, to levels that are comparable with D-SIFT and color SIFT, and two orders of magnitude faster than STIP and HoG3D. In some examples, kernel descriptors are applied to extract gradient, flow and texture based features for video analysis. In tests of the approach on a large database of internet videos used in the TRECVID MED 2011 evaluations, the flow based kernel descriptors are up to two orders of magnitude faster than STIP and HoG3D, and also produce significant performance improvements. Further, using features from multiple color planes produces small but consistent gains.

Type: Application

Filed: October 4, 2013

Publication date: April 10, 2014

Applicant: Raytheon BBN Technologies Corp.

Inventors: Pradeep Natarajan, Shuang Wu, Rohit Prasad, Premkumar Natarajan
SEGMENTATION CO-CLUSTERING

Publication number: 20140079316

Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.

Type: Application

Filed: September 17, 2013

Publication date: March 20, 2014

Applicant: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
Segmental rescoring in text recognition

Patent number: 8644611

Abstract: A method for text recognition includes generating a number of text hypotheses for an image, for example, using an HMM based approach using fixed-width analysis features. For each text hypothesis, one or more segmentations are generated and scored at the segmental level, for example, according to character or character group segments of the text hypothesis. In some embodiments, multiple alternative segmentations are considered for each text hypothesis. In some examples, scores determined in generating the text hypothesis and the segmental score are combined to select an overall text recognition of the image.

Type: Grant

Filed: June 3, 2009

Date of Patent: February 4, 2014

Assignee: Raytheon BBN Technologies Corp.

Inventors: Premkumar Natarajan, Rohit Prasad, Richard Schwartz, Krishnakumar Subramanian
Multi-frame videotext recognition

Patent number: 8290273

Abstract: Multi-frame persistence of videotext is exploited to mitigate challenges posed by varying characteristics of videotext across frame instances to improve OCR techniques. In some examples, each frame of video is processed to form multiple binary images, and one or more text hypotheses is formed from each binary image. In some examples, one or more combined images are formed from multiple frames processed to form a binary image and a corresponding text hypothesis. The text hypotheses are combined to yield an overall text recognition output.

Type: Grant

Filed: March 27, 2009

Date of Patent: October 16, 2012

Assignee: Raytheon BBN Technologies Corp.

Inventors: Rohit Prasad, Premkumar Natarajan, Ehry MacRostie
METHOD AND SYSTEM FACILITATING AN END-TO-END SOLUTION FOR ONE OR MORE SERVICE OFFERINGS

Publication number: 20110313903

Abstract: A system and method facilitating an end-to-end solution for one or more service offerings. The end-to-end solution includes the modeling, provisioning, and rating of a single subscription as at least one of prepaid and postpaid. The system includes a CRM layer, an integration layer, a rating and billing management layer, and a service activation platform layer.

Type: Application

Filed: August 12, 2010

Publication date: December 22, 2011

Applicant: INFOSYS TECHNOLOGIES LIMITED

Inventors: Gnanapriya C., Arup Goswami, Nishi Mathur, Munish Kashyap, Jamsheed Kormath, Rohit Prasad, Yuvraj Sakharam Magdum, Sreekumar Gopalakrishnan, Anand Subramanian, Soumen Saha

prev 1 2 3 4 5 6 next