Patents by Inventor Matthai Philipose

Matthai Philipose has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11593633
    Abstract: Systems, methods, and computer-readable storage devices are disclosed for improved real-time audio processing. One method including: constructing a deep neural network model, including a plurality of at least one-bit neurons, configured to output a predicted label of audio data, the plurality of at least one-bit neurons arranged in a plurality of layers, including at least one hidden layer, and being connected by a plurality of connections, each connection having at least a one-bit weight, wherein one or both of the plurality of at least one-bit neurons and the plurality of connections have a reduced bit precision; receiving a training data set, the training data set including audio data; training the deep neural network model using the training data set; and outputting a trained deep neural network model configured to output a predicted label of real-time audio data.
    Type: Grant
    Filed: April 13, 2018
    Date of Patent: February 28, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ivan Jelev Tashev, Shuayb M Zarar, Matthai Philipose, Jong Hwan Ko
  • Patent number: 11354902
    Abstract: A method can include classifying, using a compressed and specialized convolutional neural network (CNN), an object of a video frame into classes, clustering the object based on a distance of a feature vector of the object to a feature vector of a centroid object of the cluster, storing top-k classes, a centroid identification, and a cluster identification, in response to receiving a query for objects of class X from a specific video stream, retrieving image data for each centroid of each cluster that includes the class X as one of the top-k classes, classifying, using a ground truth CNN (GT-CNN), the retrieved image data for each centroid, and for each centroid determined to be classified as a member of the class X providing image data for each object in each cluster associated with the centroid.
    Type: Grant
    Filed: May 15, 2020
    Date of Patent: June 7, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ganesh Ananthanarayanan, Paramvir Bahl, Peter Bodik, Tsuwang Hsieh, Matthai Philipose
  • Patent number: 11170819
    Abstract: Described herein is a mechanism for creating a dynamic video highlight from a plurality of video segments. A metadata collection agent collects metadata comprising attributes about a video, segments within the video where one or more events occur, attributes about the creator of the video and so forth. The metadata is collected and used to create highlight video definitions comprising a set of metadata attribute-value pairs. The highlight video definitions can be created in an interactive fashion by presenting a user interface allowing selection of a combination of attribute-value pairs to include/exclude segments from the definition and/or manual selection of custom segments. The highlight video definitions can be stored and/or shared among users. The video highlight definitions are utilized to instantiate one or more video players to play the video segments in an identified order without assembling the video segments into a separate video.
    Type: Grant
    Filed: May 14, 2019
    Date of Patent: November 9, 2021
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Donald Frank Brinkman, Jr., Suvamsh Shivaprasad, Max Artemov, Lenin Ravindranath Sivalingam, Matthai Philipose, Peter Bodik
  • Patent number: 10863098
    Abstract: An imaging sensor system includes a control system that helps save energy, computing, and communication resources by capturing evaluation images with a gating image sensor and processing the evaluation images to determine whether there is a region of interest (ROI) present that warrants capturing a subsequent image with a gated image sensor for additional image processing. That is, the gating image sensor and preliminary processing operates as a gate for turning on the gated sensor for the additional image capturing and processing. The additional image captured by the gated sensor may include only the ROI, which is provided to an image processing algorithm for further image processing. In addition, a gating sensor may include infrared sensing to help determine whether the ROI includes thermal data indicative of human features.
    Type: Grant
    Filed: June 20, 2013
    Date of Patent: December 8, 2020
    Assignee: Microsoft Technology Licensing. LLC
    Inventors: Nissanka Arachchige Bodhi Priyantha, Matthai Philipose, Stephen Edward Hodges
  • Publication number: 20200365188
    Abstract: Described herein is a mechanism for creating a dynamic video highlight from a plurality of video segments. A metadata collection agent collects metadata comprising attributes about a video, segments within the video where one or more events occur, attributes about the creator of the video and so forth. The metadata is collected and used to create highlight video definitions comprising a set of metadata attribute-value pairs. The highlight video definitions can be created in an interactive fashion by presenting a user interface allowing selection of a combination of attribute-value pairs to include/exclude segments from the definition and/or manual selection of custom segments. The highlight video definitions can be stored and/or shared among users. The video highlight definitions are utilized to instantiate one or more video players to play the video segments in an identified order without assembling the video segments into a separate video.
    Type: Application
    Filed: May 14, 2019
    Publication date: November 19, 2020
    Inventors: Donald Frank Brinkman, JR., Suvamsh Shivaprasad, Max Artemov, Lenin Ravindranath Sivalingam, Matthai Philipose, Peter Bodik
  • Publication number: 20200334465
    Abstract: A method can include classifying, using a compressed and specialized convolutional neural network (CNN), an object of a video frame into classes, clustering the object based on a distance of a feature vector of the object to a feature vector of a centroid object of the cluster, storing top-k classes, a centroid identification, and a cluster identification, in response to receiving a query for objects of class X from a specific video stream, retrieving image data for each centroid of each cluster that includes the class X as one of the top-k classes, classifying, using a ground truth CNN (GT-CNN), the retrieved image data for each centroid, and for each centroid determined to be classified as a member of the class X providing image data for each object in each cluster associated with the centroid.
    Type: Application
    Filed: May 15, 2020
    Publication date: October 22, 2020
    Inventors: Ganesh Ananthanarayanan, Paramvir Bahl, Peter Bodik, Tsuwang Hsieh, Matthai Philipose
  • Patent number: 10685235
    Abstract: A method can include classifying, using a compressed and specialized convolutional neural network (CNN), an object of a video frame into classes, clustering the object based on a distance of a feature vector of the object to a feature vector of a centroid object of the cluster, storing top-k classes, a centroid identification, and a cluster identification, in response to receiving a query for objects of class X from a specific video stream, retrieving image data for each centroid of each cluster that includes the class X as one of the top-k classes, classifying, using a ground truth CNN (GT-CNN), the retrieved image data for each centroid, and for each centroid determined to be classified as a member of the class X providing image data for each object in each cluster associated with the centroid.
    Type: Grant
    Filed: May 4, 2018
    Date of Patent: June 16, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ganesh Ananthanarayanan, Paramvir Bahl, Peter Bodik, Tsuwang Hsieh, Matthai Philipose
  • Patent number: 10602054
    Abstract: A method to record video with a video camera while respecting bystander privacy includes acquiring sensory data separate from the video, parsing the sensory data for evidence of a human being in a field of view of the video camera, and recording video with the video camera if no human being is detected in the field of view, based upon the sensory data.
    Type: Grant
    Filed: January 9, 2015
    Date of Patent: March 24, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Jaeyeon Jung, Matthai Philipose
  • Patent number: 10579910
    Abstract: A classification system classifies different aspects of content of an input image stream, such as faces, landmarks, events, and so forth. The classification system includes a general classifier and at least one specialized classifier template. The general classifier is trained to classify a large number of different aspects of content, and a specialized classifier can be trained based on a specialized classifier template during operation of the classification system to classify a particular subset of the multiple different aspects of content. The classification system determines when to use the general classifier and when to use a specialized classifier based on class skew, which refers to the temporal locality of a subset of aspects of content in the image stream.
    Type: Grant
    Filed: April 20, 2018
    Date of Patent: March 3, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Matthai Philipose, Haichen Shen, Alastair Wolman, Sharad Agarwal
  • Publication number: 20190318237
    Abstract: Systems, methods, and computer-readable storage devices are disclosed for improved real-time audio processing. One method including: constructing a deep neural network model, including a plurality of at least one-bit neurons, configured to output a predicted label of audio data, the plurality of at least one-bit neurons arranged in a plurality of layers, including at least one hidden layer, and being connected by a plurality of connections, each connection having at least a one-bit weight, wherein one or both of the plurality of at least one-bit neurons and the plurality of connections have a reduced bit precision; receiving a training data set, the training data set including audio data; training the deep neural network model using the training data set; and outputting a trained deep neural network model configured to output a predicted label of real-time audio data.
    Type: Application
    Filed: April 13, 2018
    Publication date: October 17, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Ivan Jelev TASHEV, Shuayb M ZARAR, Matthai PHILIPOSE, Jong HWAN KO
  • Publication number: 20190230432
    Abstract: Wearable audio accessories for computing devices are described. In one embodiment the wearable audio accessory provides a speech based interface between the user and a nearby computing device for the performance of user-initiated or computing device initiated microtasks. Information is provided to the user via a loudspeaker and the user can provide input via a microphone. An audio sensing channel within the accessory continuously monitors the audio signal as detected by the microphone and in various embodiments will trigger more complex audio processing based on this monitoring. A wireless communication link is provided between the accessory and the nearby computing device. To mitigate any delay caused by the switching between audio processing techniques, the audio accessory may include a rolling buffer which continuously stores the audio signal and outputs a delayed audio signal to the audio processing engines.
    Type: Application
    Filed: March 29, 2019
    Publication date: July 25, 2019
    Inventors: Stephen E. HODGES, John HELMES, Matthai PHILIPOSE, Nissanka Arachchige Bodhi PRIYANTHA
  • Publication number: 20190205649
    Abstract: A method can include classifying, using a compressed and specialized convolutional neural network (CNN), an object of a video frame into classes, clustering the object based on a distance of a feature vector of the object to a feature vector of a centroid object of the cluster, storing top-k classes, a centroid identification, and a cluster identification, in response to receiving a query for objects of class X from a specific video stream, retrieving image data for each centroid of each cluster that includes the class X as one of the top-k classes, classifying, using a ground truth CNN (GT-CNN), the retrieved image data for each centroid, and for each centroid determined to be classified as a member of the class X providing image data for each object in each cluster associated with the centroid.
    Type: Application
    Filed: May 4, 2018
    Publication date: July 4, 2019
    Inventors: Ganesh Ananthanarayanan, Paramvir Bahl, Peter Bodik, Tsuwang Hsieh, Matthai Philipose
  • Patent number: 10264346
    Abstract: Wearable audio accessories for computing devices are described. In one embodiment the wearable audio accessory provides a speech based interface between the user and a nearby computing device for the performance of user-initiated or computing device initiated microtasks. Information is provided to the user via a loudspeaker and the user can provide input via a microphone. An audio sensing channel within the accessory continuously monitors the audio signal as detected by the microphone and in various embodiments will trigger more complex audio processing based on this monitoring. A wireless communication link is provided between the accessory and the nearby computing device. To mitigate any delay caused by the switching between audio processing techniques, the audio accessory may include a rolling buffer which continuously stores the audio signal and outputs a delayed audio signal to the audio processing engines.
    Type: Grant
    Filed: October 12, 2017
    Date of Patent: April 16, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Stephen E. Hodges, John Helmes, Matthai Philipose, Nissanka Arachchige Bodhi Priyantha
  • Patent number: 10225582
    Abstract: A global manager communicates with various local managers to receive and process video queries. The video queries identify components that process live video streams, placement options for where the components of the video query may be executed, and various video query plans. The video query plans include options such as framerate and video quality. As the global manager processes the video queries, the global manager determines an initial set of video query configurations that identify a video query plan and placement option for each component of a given video query. Using the initial set of video query configurations, the global manager then determines an optimal set of video query configurations for the received set of video queries. The global manager communications instructions to the local managers to execute the components of the video queries using the video query plans and placement options from the optimal set of video query configurations.
    Type: Grant
    Filed: December 15, 2017
    Date of Patent: March 5, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ganesh Ananthanarayanan, Matthai Philipose, Peter Bodik, Chien-Chun Hung, Paramvir Bahl
  • Patent number: 10223604
    Abstract: Various technologies described herein pertain to performing video analytics. The approaches set forth herein support live video analytics at scale with approximate and delay-tolerant processing. Video streams can be captured by multiple cameras and continuously streamed to a video analytics computing system; the video streams can be received at the video analytics computing system. Multiple video analytics queries can be executed on the video streams. The multiple video analytics queries can be concurrently executed by the video analytics computing system on the video streams as the video streams are continuously streamed to the video analytics computing system. The multiple video analytics queries can be executed utilizing resources of the video analytics computing system allocated between the multiple video analytics queries. Execution of the multiple video analytics queries can return respective results for the multiple video analytics queries.
    Type: Grant
    Filed: December 8, 2016
    Date of Patent: March 5, 2019
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Paramvir Bahl, Peter Bodik, Matthai Philipose, Ganesh Ananthanarayanan, Haoyu Zhang
  • Publication number: 20190068996
    Abstract: A global manager communicates with various local managers to receive and process video queries. The video queries identify components that process live video streams, placement options for where the components of the video query may be executed, and various video query plans. The video query plans include options such as framerate and video quality. As the global manager processes the video queries, the global manager determines an initial set of video query configurations that identify a video query plan and placement option for each component of a given video query. Using the initial set of video query configurations, the global manager then determines an optimal set of video query configurations for the received set of video queries. The global manager communications instructions to the local managers to execute the components of the video queries using the video query plans and placement options from the optimal set of video query configurations.
    Type: Application
    Filed: December 15, 2017
    Publication date: February 28, 2019
    Inventors: Ganesh Ananthanarayanan, Matthai Philipose, Peter Bodik, Chien-Chun Hung, Paramvir Bahl
  • Publication number: 20180239990
    Abstract: A classification system classifies different aspects of content of an input image stream, such as faces, landmarks, events, and so forth. The classification system includes a general classifier and at least one specialized classifier template. The general classifier is trained to classify a large number of different aspects of content, and a specialized classifier can be trained based on a specialized classifier template during operation of the classification system to classify a particular subset of the multiple different aspects of content. The classification system determines when to use the general classifier and when to use a specialized classifier based on class skew, which refers to the temporal locality of a subset of aspects of content in the image stream.
    Type: Application
    Filed: April 20, 2018
    Publication date: August 23, 2018
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Matthai Philipose, Haichen Shen, Alastair Wolman, Sharad Agarwal
  • Publication number: 20180199131
    Abstract: Wearable audio accessories for computing devices are described. In one embodiment the wearable audio accessory provides a speech based interface between the user and a nearby computing device for the performance of user-initiated or computing device initiated microtasks. Information is provided to the user via a loudspeaker and the user can provide input via a microphone. An audio sensing channel within the accessory continuously monitors the audio signal as detected by the microphone and in various embodiments will trigger more complex audio processing based on this monitoring. A wireless communication link is provided between the accessory and the nearby computing device. To mitigate any delay caused by the switching between audio processing techniques, the audio accessory may include a rolling buffer which continuously stores the audio signal and outputs a delayed audio signal to the audio processing engines.
    Type: Application
    Filed: October 12, 2017
    Publication date: July 12, 2018
    Inventors: Stephen E. HODGES, John HELMES, Matthai PHILIPOSE, Nissanka Arachchige Bodhi Priyantha
  • Patent number: 9984314
    Abstract: A classification system classifies different aspects of content of an input image stream, such as faces, landmarks, events, and so forth. The classification system includes a general classifier and at least one specialized classifier template. The general classifier is trained to classify a large number of different aspects of content, and a specialized classifier can be trained based on a specialized classifier template during operation of the classification system to classify a particular subset of the multiple different aspects of content. The classification system determines when to use the general classifier and when to use a specialized classifier based on class skew, which refers to the temporal locality of a subset of aspects of content in the image stream.
    Type: Grant
    Filed: May 6, 2016
    Date of Patent: May 29, 2018
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Matthai Philipose, Haichen Shen, Alastair Wolman, Sharad Agarwal
  • Publication number: 20180129892
    Abstract: Various technologies described herein pertain to performing video analytics. The approaches set forth herein support live video analytics at scale with approximate and delay-tolerant processing. Video streams can be captured by multiple cameras and continuously streamed to a video analytics computing system; the video streams can be received at the video analytics computing system. Multiple video analytics queries can be executed on the video streams. The multiple video analytics queries can be concurrently executed by the video analytics computing system on the video streams as the video streams are continuously streamed to the video analytics computing system. The multiple video analytics queries can be executed utilizing resources of the video analytics computing system allocated between the multiple video analytics queries. Execution of the multiple video analytics queries can return respective results for the multiple video analytics queries.
    Type: Application
    Filed: December 8, 2016
    Publication date: May 10, 2018
    Inventors: Paramvir Bahl, Peter Bodik, Matthai Philipose, Ganesh Ananthanarayanan, Haoyu Zhang