Patents by Inventor Ambrish Tyagi
Ambrish Tyagi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11810597Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.Type: GrantFiled: October 4, 2021Date of Patent: November 7, 2023Assignee: Amazon Technologies, Inc.Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
-
Patent number: 11631260Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.Type: GrantFiled: December 23, 2020Date of Patent: April 18, 2023Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Shashank Tripathi, Visesh Chari, Ambrish Tyagi, Amit Kumar Agrawal, James Rehg, Siddhartha Chandra
-
Patent number: 11526697Abstract: Devices and techniques are generally described for estimating three-dimensional pose data. In some examples, a first machine learning network may generate first three-dimensional (3D) data representing input 2D data. In various examples, a first 2D projection of the first 3D data may be generated. A determination may be made that the first 2D projection conforms to a distribution of natural 2D data. A second machine learning network may generate parameters of a 3D model based at least in part on the input 2D data and based at least in part on the first 3D data. In some examples, second 3D data may be generated using the parameters of the 3D model.Type: GrantFiled: March 10, 2020Date of Patent: December 13, 2022Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Shashank Tripathi, Ambrish Tyagi, Amit Kumar Agrawal, Siddhant Ranade
-
Patent number: 11450008Abstract: Devices and techniques are generally described for weakly-supervised object segmentation in image data. In various examples, a first frame of image data may be received. The first frame may include a first bounding box surrounding a first set of pixels, wherein first subset of pixels of the first set of pixels represent a first object of a first class and wherein second subset of pixels of the first set of pixels represent background image data. Cross-entropy loss may be determined for the first set of pixels. In some examples, a spatial attention map may be determined for the first set of pixels. In further examples, parameters of a convolutional neural network may be determined by modulating the cross-entropy loss for the first set of pixels using the spatial attention map. The convolutional neural network may be used to generate a segmentation map.Type: GrantFiled: February 27, 2020Date of Patent: September 20, 2022Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Ambrish Tyagi, Siddhartha Chandra, Amit Kumar Agrawal, Viveka Kulharia
-
Publication number: 20220122639Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.Type: ApplicationFiled: October 4, 2021Publication date: April 21, 2022Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
-
Patent number: 11158344Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.Type: GrantFiled: September 30, 2015Date of Patent: October 26, 2021Assignee: Amazon Technologies, Inc.Inventors: Matthew Alan Townsend, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Ambrish Tyagi, Eduard Oks, Apoorv Chaudhri
-
Patent number: 10909349Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.Type: GrantFiled: June 24, 2019Date of Patent: February 2, 2021Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Shashank Tripathi, Visesh Chari, Ambrish Tyagi, Amit Kumar Agrawal, James Rehg, Siddhartha Chandra
-
Patent number: 10860836Abstract: Techniques are generally described for object detection in image data. First image data comprising a first plurality of pixel values representing an object and a second plurality of pixel values representing a background may be received. First foreground image data and first background image data may be generated from the first image data. A first feature vector representing the first plurality of pixel values may be generated. A second feature vector representing a first plurality of pixel values of second background image data may be generated. A first machine learning model may determine a first operation to perform on the first foreground image data. A transformed representation of the first foreground image data may be generated by performing the first operation on the first foreground image data. Composite image data may be generated by compositing the transformed representation of the first foreground image data with the second background image data.Type: GrantFiled: November 15, 2018Date of Patent: December 8, 2020Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Ambrish Tyagi, Amit Kumar Agrawal, Siddhartha Chandra, Visesh Uday Kumar Chari, Shashank Tripathi, James Rehg
-
Patent number: 10582149Abstract: A system and method for generating preview data from video data and using the preview data to select portions of the video data or determine an order with which to upload the video data. The system may sample video data to generate sampled video data and may identify portions of the sampled video data having complexity metrics exceeding a threshold. The system may upload a first portion of the video data corresponding to the identified portions while omitting a second portion of the video data. The system may determine an order with which to upload portions of the video data based on a complexity of the video data. Therefore, portions of the video data that may require additional processing after being uploaded may be prioritized and uploaded first. As a result, a latency between the video data being uploaded and a video summarization being received is reduced.Type: GrantFiled: February 17, 2017Date of Patent: March 3, 2020Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Rohith Mysore Vijaya Kumar, Ambrish Tyagi, Yadunandana Nagaraja Rao, Suresh Bholabhai Lakhani, Amit Kumar Agrawal
-
Patent number: 10554850Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.Type: GrantFiled: March 7, 2019Date of Patent: February 4, 2020Assignee: Amazon Technologies, Inc.Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
-
Patent number: 10482925Abstract: A system and method for selecting portions of video data from preview video data is provided. The system may extract image features from the preview video data and discard video frames associated with poor image quality based on the image features. The system may determine similarity scores between individual video frames and corresponding transition costs and may identify transition points in the preview video data based on the similarity scores and/or transition costs. The system may select portions of the video data for further processing based on the transition points and the image features. By selecting portions of the video data, the system may reduce a bandwidth consumption, processing burden and/or latency associated with uploading the video data or performing further processing.Type: GrantFiled: October 13, 2017Date of Patent: November 19, 2019Assignee: Amazon Technologies, Inc.Inventors: Ambrish Tyagi, Suresh Bholabhai Lakhani, Rohith Mysore Vijaya Kumar, Yadunandana Nagaraja Rao, Amit Kumar Agrawal
-
Publication number: 20190273837Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.Type: ApplicationFiled: March 7, 2019Publication date: September 5, 2019Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
-
Patent number: 10230866Abstract: Devices, systems and methods are disclosed for reducing a perceived latency associated with uploading and annotating video data. For example, video data may be divided into video sections that are uploaded individually so that the video sections may be annotated as they are received. This reduces a latency associated with the annotation process, as a portion of the video data is annotated before an entirety of the video data is uploaded. In addition, the annotation data may be used to generate a master clip table and extract individual video clips from the video data.Type: GrantFiled: September 30, 2015Date of Patent: March 12, 2019Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Matthew Alan Townsend, Eduard Oks, Rohith Mysore Vijaya Kumar, Apoorv Chaudhri, Yadunandana Nagaraja Rao, Ambrish Tyagi
-
Patent number: 10085001Abstract: According to one aspect of the teachings herein, a method and apparatus detect mechanical misalignments in a machine vision system during run-time operation of the machine vision system, and compensate image processing based on the detected misalignments, unless the detected misalignments are excessive. Excessive misalignments may be detected by determining a worst-case error based on them. If the worst-case error exceeds a defined limit, the machine vision system transitions to a fault state. The fault state may include disrupting operation of a hazardous machine or performing one or more other fault-state operations. Among the detected misalignments are internal misalignments within individual cameras used for imaging, and relative misalignments between cameras. The method and apparatus may further perform run-time verifications of focus and transition the machine vision system to a fault state responsive to detecting insufficient focal quality.Type: GrantFiled: March 18, 2015Date of Patent: September 25, 2018Assignee: Omron CorporationInventors: Takeshi Shoji, John Drinkard, Ambrish Tyagi
-
Patent number: 10027883Abstract: Various embodiments enable a primary user to be identified and tracked using stereo association and multiple tracking algorithms. For example, a face detection algorithm can be run on each image captured by a respective camera independently. Stereo association can be performed to match faces between cameras. If the faces are matched and a primary user is determined, a face pair is created and used as the first data point in memory for initializing object tracking. Further, features of a user's face can be extracted and the change in position of these features between images can determine what tracking method will be used for that particular frame.Type: GrantFiled: June 18, 2014Date of Patent: July 17, 2018Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Cheng-Hao Kuo, Jim Oommen Thomas, Tianyang Ma, Stephen Vincent Mangiat, Sisil Sanjeev Mehta, Ambrish Tyagi, Amit Kumar Agrawal, Kah Kuen Fu, Sharadh Ramaswamy
-
Patent number: 10007860Abstract: The techniques described herein may identify images that likely depict one or more items by comparing features of the items to features of different regions-of-interest (ROIs) of the images. For instance, some of the images may include a user, and the techniques may define multiple regions within the image corresponding to different portions of the user. The techniques may then use a trained convolutional neural network or any other type of trained classifier to determine, for each region of the image, whether the region depicts a particular item. If so, the techniques may designate the corresponding image as depicting the item and may output an indication that the image depicts the item. The techniques may perform this process for multiple images, outputting an indication of each image deemed to depict the particular item.Type: GrantFiled: December 21, 2015Date of Patent: June 26, 2018Assignee: Amazon Technologies, Inc.Inventors: David Allen Fotland, Ambrish Tyagi
-
Patent number: 9992412Abstract: A camera device having verged cameras is disclosed. A camera device may include a housing and four cameras disposed in the housing. The housing may define a horizontal plane passing through the center of the housing. Each of the four cameras may be verged at an angle defined by a longitudinal center axis of the camera and the horizontal plane. Each camera may include a vertical field of view verged at the same angle. The camera device may produce a panoramic image (e.g., a panoramic still image or panoramic video) using two or more of the cameras. Systems and processes including the camera device are also disclosed.Type: GrantFiled: April 15, 2015Date of Patent: June 5, 2018Assignee: Amazon Technologies, Inc.Inventor: Ambrish Tyagi
-
Patent number: 9953242Abstract: The techniques described herein may identify images that likely depict one or more items by comparing features of the items to features of different regions-of-interest (ROIs) of the images. When a user requests to identify images that depict a particular item, the techniques may determine a region-of-interest (ROI) size based on the size of the requested item. The techniques may then search multiple images using the ROI size.Type: GrantFiled: December 21, 2015Date of Patent: April 24, 2018Assignee: Amazon Technologies, Inc.Inventors: Ambrish Tyagi, David Allen Fotland
-
Patent number: 9866820Abstract: An electronic device can have two or more pairs of cameras capable of performing three-dimensional imaging. In order to provide accurate disparity information, these cameras should be sufficiently calibrated. Automatic calibration can be performed by periodically capturing images with a pair of front-facing cameras and locating matching facial or other feature points in corresponding images captured by those cameras. Correspondences can be detected between feature points and the corresponding feature points can be normalized and outlier feature points can be rejected. A transformation matrix can be determined using at least a portion of remaining feature points and can be used to determine rotation and translation parameters to correct for misalignment between the cameras. The calibration parameters can be refined or otherwise adjusted, and can be used or stored for use in correcting images subsequently captured by those cameras.Type: GrantFiled: July 1, 2014Date of Patent: January 9, 2018Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Amit Kumar Agrawal, Ilya Vladimirovich Brailovskiy, Sharadh Ramaswamy, Ambrish Tyagi
-
Patent number: 9842402Abstract: Various examples are directed to systems and methods for detecting regions in video frames. For example, a computing device may receive a video comprising a plurality of frames and a video frame sequence of the plurality of frames. The computing device may select a plurality of scene point location from a first frame. The computing device may determine a plurality of columns in the first frame and fit a first sinusoidal function to a distribution of average column Y-axis displacements for the plurality of columns by column position. The computing device may determine a first difference based at least in part on the first scene point Y-axis displacement and an output of the first sinusoidal function at the X-axis position of the first scene point and determine that the first difference is greater than a threshold distance.Type: GrantFiled: December 21, 2015Date of Patent: December 12, 2017Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Rohith Mysore Vijaya Kumar, Abhishek Singh, Ambrish Tyagi