Patents by Inventor Ambrish Tyagi

Ambrish Tyagi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11810597
    Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.
    Type: Grant
    Filed: October 4, 2021
    Date of Patent: November 7, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
  • Patent number: 11631260
    Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
    Type: Grant
    Filed: December 23, 2020
    Date of Patent: April 18, 2023
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Shashank Tripathi, Visesh Chari, Ambrish Tyagi, Amit Kumar Agrawal, James Rehg, Siddhartha Chandra
  • Patent number: 11526697
    Abstract: Devices and techniques are generally described for estimating three-dimensional pose data. In some examples, a first machine learning network may generate first three-dimensional (3D) data representing input 2D data. In various examples, a first 2D projection of the first 3D data may be generated. A determination may be made that the first 2D projection conforms to a distribution of natural 2D data. A second machine learning network may generate parameters of a 3D model based at least in part on the input 2D data and based at least in part on the first 3D data. In some examples, second 3D data may be generated using the parameters of the 3D model.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: December 13, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Shashank Tripathi, Ambrish Tyagi, Amit Kumar Agrawal, Siddhant Ranade
  • Patent number: 11450008
    Abstract: Devices and techniques are generally described for weakly-supervised object segmentation in image data. In various examples, a first frame of image data may be received. The first frame may include a first bounding box surrounding a first set of pixels, wherein first subset of pixels of the first set of pixels represent a first object of a first class and wherein second subset of pixels of the first set of pixels represent background image data. Cross-entropy loss may be determined for the first set of pixels. In some examples, a spatial attention map may be determined for the first set of pixels. In further examples, parameters of a convolutional neural network may be determined by modulating the cross-entropy loss for the first set of pixels using the spatial attention map. The convolutional neural network may be used to generate a segmentation map.
    Type: Grant
    Filed: February 27, 2020
    Date of Patent: September 20, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ambrish Tyagi, Siddhartha Chandra, Amit Kumar Agrawal, Viveka Kulharia
  • Publication number: 20220122639
    Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.
    Type: Application
    Filed: October 4, 2021
    Publication date: April 21, 2022
    Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
  • Patent number: 11158344
    Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.
    Type: Grant
    Filed: September 30, 2015
    Date of Patent: October 26, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
  • Patent number: 10909349
    Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: February 2, 2021
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Shashank Tripathi, Visesh Chari, Ambrish Tyagi, Amit Kumar Agrawal, James Rehg, Siddhartha Chandra
  • Patent number: 10860836
    Abstract: Techniques are generally described for object detection in image data. First image data comprising a first plurality of pixel values representing an object and a second plurality of pixel values representing a background may be received. First foreground image data and first background image data may be generated from the first image data. A first feature vector representing the first plurality of pixel values may be generated. A second feature vector representing a first plurality of pixel values of second background image data may be generated. A first machine learning model may determine a first operation to perform on the first foreground image data. A transformed representation of the first foreground image data may be generated by performing the first operation on the first foreground image data. Composite image data may be generated by compositing the transformed representation of the first foreground image data with the second background image data.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: December 8, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ambrish Tyagi, Amit Kumar Agrawal, Siddhartha Chandra, Visesh Uday Kumar Chari, Shashank Tripathi, James Rehg
  • Patent number: 10582149
    Abstract: A system and method for generating preview data from video data and using the preview data to select portions of the video data or determine an order with which to upload the video data. The system may sample video data to generate sampled video data and may identify portions of the sampled video data having complexity metrics exceeding a threshold. The system may upload a first portion of the video data corresponding to the identified portions while omitting a second portion of the video data. The system may determine an order with which to upload portions of the video data based on a complexity of the video data. Therefore, portions of the video data that may require additional processing after being uploaded may be prioritized and uploaded first. As a result, a latency between the video data being uploaded and a video summarization being received is reduced.
    Type: Grant
    Filed: February 17, 2017
    Date of Patent: March 3, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Rohith Mysore Vijaya Kumar, Ambrish Tyagi, Yadunandana Nagaraja Rao, Suresh Bholabhai Lakhani, Amit Kumar Agrawal
  • Patent number: 10554850
    Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: February 4, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
  • Patent number: 10482925
    Abstract: A system and method for selecting portions of video data from preview video data is provided. The system may extract image features from the preview video data and discard video frames associated with poor image quality based on the image features. The system may determine similarity scores between individual video frames and corresponding transition costs and may identify transition points in the preview video data based on the similarity scores and/or transition costs. The system may select portions of the video data for further processing based on the transition points and the image features. By selecting portions of the video data, the system may reduce a bandwidth consumption, processing burden and/or latency associated with uploading the video data or performing further processing.
    Type: Grant
    Filed: October 13, 2017
    Date of Patent: November 19, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Ambrish Tyagi, Suresh Bholabhai Lakhani, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Amit Kumar Agrawal
  • Publication number: 20190273837
    Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.
    Type: Application
    Filed: March 7, 2019
    Publication date: September 5, 2019
    Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
  • Patent number: 10230866
    Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.
    Type: Grant
    Filed: September 30, 2015
    Date of Patent: March 12, 2019
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
  • Patent number: 10085001
    Abstract: According to one aspect of the teachings herein, a method and apparatus detect mechanical misalignments in a machine vision system during run-time operation of the machine vision system, and compensate image processing based on the detected misalignments, unless the detected misalignments are excessive. Excessive misalignments may be detected by determining a worst-case error based on them. If the worst-case error exceeds a defined limit, the machine vision system transitions to a fault state. The fault state may include disrupting operation of a hazardous machine or performing one or more other fault-state operations. Among the detected misalignments are internal misalignments within individual cameras used for imaging, and relative misalignments between cameras. The method and apparatus may further perform run-time verifications of focus and transition the machine vision system to a fault state responsive to detecting insufficient focal quality.
    Type: Grant
    Filed: March 18, 2015
    Date of Patent: September 25, 2018
    Assignee: Omron Corporation
    Inventors: Takeshi Shoji, John Drinkard, Ambrish Tyagi
  • Patent number: 10027883
    Abstract: Various embodiments enable a primary user to be identified and tracked using stereo association and multiple tracking algorithms. For example, a face detection algorithm can be run on each image captured by a respective camera independently. Stereo association can be performed to match faces between cameras. If the faces are matched and a primary user is determined, a face pair is created and used as the first data point in memory for initializing object tracking. Further, features of a user's face can be extracted and the change in position of these features between images can determine what tracking method will be used for that particular frame.
    Type: Grant
    Filed: June 18, 2014
    Date of Patent: July 17, 2018
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Cheng-Hao Kuo, Jim Oommen Thomas, Tianyang Ma, Stephen Vincent Mangiat, Sisil Sanjeev Mehta, Ambrish Tyagi, Amit Kumar Agrawal, Kah Kuen Fu, Sharadh Ramaswamy
  • Patent number: 10007860
    Abstract: The techniques described herein may identify images that likely depict one or more items by comparing features of the items to features of different regions-of-interest (ROIs) of the images. For instance, some of the images may include a user, and the techniques may define multiple regions within the image corresponding to different portions of the user. The techniques may then use a trained convolutional neural network or any other type of trained classifier to determine, for each region of the image, whether the region depicts a particular item. If so, the techniques may designate the corresponding image as depicting the item and may output an indication that the image depicts the item. The techniques may perform this process for multiple images, outputting an indication of each image deemed to depict the particular item.
    Type: Grant
    Filed: December 21, 2015
    Date of Patent: June 26, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: David Allen Fotland, Ambrish Tyagi
  • Patent number: 9992412
    Abstract: A camera device having verged cameras is disclosed. A camera device may include a housing and four cameras disposed in the housing. The housing may define a horizontal plane passing through the center of the housing. Each of the four cameras may be verged at an angle defined by a longitudinal center axis of the camera and the horizontal plane. Each camera may include a vertical field of view verged at the same angle. The camera device may produce a panoramic image (e.g., a panoramic still image or panoramic video) using two or more of the cameras. Systems and processes including the camera device are also disclosed.
    Type: Grant
    Filed: April 15, 2015
    Date of Patent: June 5, 2018
    Assignee: Amazon Technologies, Inc.
    Inventor: Ambrish Tyagi
  • Patent number: 9953242
    Abstract: The techniques described herein may identify images that likely depict one or more items by comparing features of the items to features of different regions-of-interest (ROIs) of the images. When a user requests to identify images that depict a particular item, the techniques may determine a region-of-interest (ROI) size based on the size of the requested item. The techniques may then search multiple images using the ROI size.
    Type: Grant
    Filed: December 21, 2015
    Date of Patent: April 24, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Ambrish Tyagi, David Allen Fotland
  • Patent number: 9866820
    Abstract: An electronic device can have two or more pairs of cameras capable of performing three-dimensional imaging. In order to provide accurate disparity information, these cameras should be sufficiently calibrated. Automatic calibration can be performed by periodically capturing images with a pair of front-facing cameras and locating matching facial or other feature points in corresponding images captured by those cameras. Correspondences can be detected between feature points and the corresponding feature points can be normalized and outlier feature points can be rejected. A transformation matrix can be determined using at least a portion of remaining feature points and can be used to determine rotation and translation parameters to correct for misalignment between the cameras. The calibration parameters can be refined or otherwise adjusted, and can be used or stored for use in correcting images subsequently captured by those cameras.
    Type: Grant
    Filed: July 1, 2014
    Date of Patent: January 9, 2018
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Amit Kumar Agrawal, Ilya Vladimirovich Brailovskiy, Sharadh Ramaswamy, Ambrish Tyagi
  • Patent number: 9842402
    Abstract: Various examples are directed to systems and methods for detecting regions in video frames. For example, a computing device may receive a video comprising a plurality of frames and a video frame sequence of the plurality of frames. The computing device may select a plurality of scene point location from a first frame. The computing device may determine a plurality of columns in the first frame and fit a first sinusoidal function to a distribution of average column Y-axis displacements for the plurality of columns by column position. The computing device may determine a first difference based at least in part on the first scene point Y-axis displacement and an output of the first sinusoidal function at the X-axis position of the first scene point and determine that the first difference is greater than a threshold distance.
    Type: Grant
    Filed: December 21, 2015
    Date of Patent: December 12, 2017
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Rohith Mysore Vijaya Kumar, Abhishek Singh, Ambrish Tyagi