Patents by Inventor Rohit Prasad

Rohit Prasad has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9418283
    Abstract: A system to recognize text, objects, or symbols in a captured image using machine learning models reduces computational overhead by generating a plurality of thumbnail versions of the image at different downscaled resolutions and aspect ratios, and then processing the downscaled images instead of the entire image, or sections of the entire image. The downscaled images are processed to produce a combine feature vector characterizing the overall image. The combined feature vector is processed using the machine learning model.
    Type: Grant
    Filed: August 20, 2014
    Date of Patent: August 16, 2016
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Pradeep Natarajan, Avnish Sikka, Rohit Prasad
  • Patent number: 9368105
    Abstract: Natural language controlled devices may be configured to activate command recognition in response to one or more wake words. Techniques are provided to allow for multiple operating modes in which different recognition parameters are employed in recognizing wake words that activate the natural language control functionality of a computing device.
    Type: Grant
    Filed: June 26, 2014
    Date of Patent: June 14, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Ian W. Freed, William Folwell Barton, Rohit Prasad
  • Patent number: 9330332
    Abstract: An approach to computation of kernel descriptors is accelerated using precomputed tables. In one aspect, a fast algorithm for kernel descriptor computation that takes O(1) operations per pixel in each patch, based on pre-computed kernel values. This speeds up the kernel descriptor features under consideration, to levels that are comparable with D-SIFT and color SIFT, and two orders of magnitude faster than STIP and HoG3D. In some examples, kernel descriptors are applied to extract gradient, flow and texture based features for video analysis. In tests of the approach on a large database of internet videos used in the TRECVID MED 2011 evaluations, the flow based kernel descriptors are up to two orders of magnitude faster than STIP and HoG3D, and also produce significant performance improvements. Further, using features from multiple color planes produces small but consistent gains.
    Type: Grant
    Filed: October 4, 2013
    Date of Patent: May 3, 2016
    Assignee: Raytheon BBN Technologies Corp.
    Inventors: Pradeep Natarajan, Shuang Wu, Rohit Prasad, Premkumar Natarajan
  • Patent number: 9224207
    Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.
    Type: Grant
    Filed: September 17, 2013
    Date of Patent: December 29, 2015
    Assignee: RAYTHEON BBN TECHNOLOGIES CORP.
    Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
  • Patent number: 9224061
    Abstract: A system estimates text orientation in images captured using a handheld camera prior detecting text in the image. Text orientation is estimated based on edges detected within the image, and the image is rotated based on the estimated orientation. Text detection and processing is then performed on the rotated image. Non-text features along a periphery of the image may be sampled to assure that clutter will not undermine the estimation of orientation.
    Type: Grant
    Filed: August 20, 2014
    Date of Patent: December 29, 2015
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Pradeep Natarajan, Avnish Sikka, Rohit Prasad
  • Patent number: 8953892
    Abstract: A computationally efficient approach to determining inner products between feature vectors is provided that eliminates or reduces the need for multiplication, and more specifically, provides an efficient and accurate basis selection for techniques such as Orthogonal Matching Pursuit.
    Type: Grant
    Filed: November 5, 2012
    Date of Patent: February 10, 2015
    Assignee: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
  • Patent number: 8861872
    Abstract: Distributional information for a set of ? vectors is determined using a sparse basis selection approach to representing an input image or video. In some examples, this distributional information is used for a classification task.
    Type: Grant
    Filed: November 5, 2012
    Date of Patent: October 14, 2014
    Assignee: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
  • Publication number: 20140297252
    Abstract: A two-way speech-to-speech (S2S) translation system actively detects a wide variety of common error types and resolves them through user-friendly dialog with the user(s). Examples include features including one or more of detecting out-of-vocabulary (OOV) named entities and terms, sensing ambiguities, homophones, idioms, ill-formed input, etc. and interactive strategies for recovering from such errors. In some examples, different error types are prioritized and systems implementing the approach can include an extensible architecture for implementing these decisions.
    Type: Application
    Filed: December 6, 2013
    Publication date: October 2, 2014
    Inventors: Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Matthew Roy, Frederick Choi
  • Publication number: 20140126824
    Abstract: A computationally efficient approach to determining inner products between feature vectors is provided that eliminates or reduces the need for multiplication, and more specifically, provides an efficient and accurate basis selection for techniques such as Orthogonal Matching Pursuit.
    Type: Application
    Filed: November 5, 2012
    Publication date: May 8, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
  • Publication number: 20140126817
    Abstract: Distributional information for a set of ? vectors is determined using a sparse basis selection approach to representing an input image or video. In some examples, this distributional information is used for a classification task.
    Type: Application
    Filed: November 5, 2012
    Publication date: May 8, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Pradeep Natarajan, Rohit Prasad, Premkumar Natarajan
  • Publication number: 20140099033
    Abstract: An approach to computation of kernel descriptors is accelerated using precomputed tables. In one aspect, a fast algorithm for kernel descriptor computation that takes O(1) operations per pixel in each patch, based on pre-computed kernel values. This speeds up the kernel descriptor features under consideration, to levels that are comparable with D-SIFT and color SIFT, and two orders of magnitude faster than STIP and HoG3D. In some examples, kernel descriptors are applied to extract gradient, flow and texture based features for video analysis. In tests of the approach on a large database of internet videos used in the TRECVID MED 2011 evaluations, the flow based kernel descriptors are up to two orders of magnitude faster than STIP and HoG3D, and also produce significant performance improvements. Further, using features from multiple color planes produces small but consistent gains.
    Type: Application
    Filed: October 4, 2013
    Publication date: April 10, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventors: Pradeep Natarajan, Shuang Wu, Rohit Prasad, Premkumar Natarajan
  • Publication number: 20140079316
    Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.
    Type: Application
    Filed: September 17, 2013
    Publication date: March 20, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
  • Patent number: 8644611
    Abstract: A method for text recognition includes generating a number of text hypotheses for an image, for example, using an HMM based approach using fixed-width analysis features. For each text hypothesis, one or more segmentations are generated and scored at the segmental level, for example, according to character or character group segments of the text hypothesis. In some embodiments, multiple alternative segmentations are considered for each text hypothesis. In some examples, scores determined in generating the text hypothesis and the segmental score are combined to select an overall text recognition of the image.
    Type: Grant
    Filed: June 3, 2009
    Date of Patent: February 4, 2014
    Assignee: Raytheon BBN Technologies Corp.
    Inventors: Premkumar Natarajan, Rohit Prasad, Richard Schwartz, Krishnakumar Subramanian
  • Patent number: 8290273
    Abstract: Multi-frame persistence of videotext is exploited to mitigate challenges posed by varying characteristics of videotext across frame instances to improve OCR techniques. In some examples, each frame of video is processed to form multiple binary images, and one or more text hypotheses is formed from each binary image. In some examples, one or more combined images are formed from multiple frames processed to form a binary image and a corresponding text hypothesis. The text hypotheses are combined to yield an overall text recognition output.
    Type: Grant
    Filed: March 27, 2009
    Date of Patent: October 16, 2012
    Assignee: Raytheon BBN Technologies Corp.
    Inventors: Rohit Prasad, Premkumar Natarajan, Ehry MacRostie
  • Publication number: 20110313903
    Abstract: A system and method facilitating an end-to-end solution for one or more service offerings. The end-to-end solution includes the modeling, provisioning, and rating of a single subscription as at least one of prepaid and postpaid. The system includes a CRM layer, an integration layer, a rating and billing management layer, and a service activation platform layer.
    Type: Application
    Filed: August 12, 2010
    Publication date: December 22, 2011
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Gnanapriya C., Arup Goswami, Nishi Mathur, Munish Kashyap, Jamsheed Kormath, Rohit Prasad, Yuvraj Sakharam Magdum, Sreekumar Gopalakrishnan, Anand Subramanian, Soumen Saha
  • Publication number: 20100310172
    Abstract: A method for text recognition includes generating a number of text hypotheses for an image, for example, using an HMM based approach using fixed-width analysis features. For each text hypothesis, one or more segmentations are generated and scored at the segmental level, for example, according to character or character group segments of the text hypothesis. In some embodiments, multiple alternative segmentations are considered for each text hypothesis. In some examples, scores determined in generating the text hypothesis and the segmental score are combined to select an overall text recognition of the image.
    Type: Application
    Filed: June 3, 2009
    Publication date: December 9, 2010
    Applicant: BBN Technologies Corp.
    Inventors: Premkumar Natarajan, Rohit Prasad, Richard Schwartz, Krishnakumar Subramanian
  • Publication number: 20100246961
    Abstract: Multi-frame persistence of videotext is exploited to mitigate challenges posed by varying characteristics of videotext across frame instances to improve OCR techniques. In some examples, each frame of video is processed to form multiple binary images, and one or more text hypotheses is formed from each binary image. In some examples, one or more combined images are formed from multiple frames processed to form a binary image and a corresponding text hypothesis. The text hypotheses are combined to yield an overall text recognition output.
    Type: Application
    Filed: March 27, 2009
    Publication date: September 30, 2010
    Applicant: BBN Technologies Corp.
    Inventors: Rohit Prasad, Premkumar Natarajan, Ehry MacRostie
  • Patent number: 7346507
    Abstract: A method and apparatus for building a training set for an automated speech recognition-based system, which determines the statistically optimal number of frequently requested responses to automate in order to achieve a desired automation rate. The invention may be used to select the appropriate tokens and responses to train the system and to achieve a desired “phrase coverage” for all of the many different ways human beings may phrase a request that calls for one of a plurality of frequently-requested responses. The invention also determines the statistically optimal number of tokens (spoken requests) required to train a speech recognition-based system to achieve the desired phrase coverage and optimal allocation of tokens over the set of responses that are to be automated.
    Type: Grant
    Filed: June 4, 2003
    Date of Patent: March 18, 2008
    Assignee: BBN Technologies Corp.
    Inventors: Premkumar Natarajan, Rohit Prasad
  • Publication number: 20060136472
    Abstract: A method and system for supporting a concurrent recordation of a change in a data file by a server while allowing an application to continue writing changes to a data file. In response to a change in a data file, a near-instantaneous version of the file is created. Metadata reflecting the change to the data file are synchronized with a version of the file in cache and recorded in persistent storage. During the process of recording metadata changes to the file, subsequent changes to the data file may continue, and metadata reflecting the changes may be recorded in a subsequent near-instantaneous version of the file which may also be synchronized with a version of the metadata in persistent storage.
    Type: Application
    Filed: December 20, 2004
    Publication date: June 22, 2006
    Inventors: Venkateswararao Jujjuri, Malahal Naineni, Rohit Prasad, Senthil Rajaram, Roger Raphael