Patents by Inventor Ruchi Bhargava

Ruchi Bhargava has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

External visual interactions for speech-based devices

Patent number: 11327711

Abstract: Examples are disclosed herein that are related to providing extended functionalities on-demand to an audio-based wearable device. One example provides a wearable computing device including an acoustic receiver configured to receive speech inputs, a speaker configured to present audio outputs, a communications subsystem configured to connect to an external device, a logic subsystem configured to execute instructions, and a storage subsystem having instructions executable by the logic subsystem to execute a program, connect to the external device via a wireless communications protocol, conduct an audio-based interaction of the program via the speech inputs received at the acoustic receiver and the audio outputs provided by the speaker, upon reaching a screen-based interaction of the program, notify a user via the speaker to interact with the external device, and provide image data to the external device for presentation via a screen of the external device.

Type: Grant

Filed: December 5, 2014

Date of Patent: May 10, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Weidong Zhao, Gary Caldwell, Ruchi Bhargava
MODEL-BASED REINFORCEMENT LEARNING FOR BEHAVIOR PREDICTION

Publication number: 20220138568

Abstract: In various examples, reinforcement learning is used to train at least one machine learning model (MLM) to control a vehicle by leveraging a deep neural network (DNN) trained on real-world data by using imitation learning to predict movements of one or more actors to define a world model. The DNN may be trained from real-world data to predict attributes of actors, such as locations and/or movements, from input attributes. The predictions may define states of the environment in a simulator, and one or more attributes of one or more actors input into the DNN may be modified or controlled by the simulator to simulate conditions that may otherwise be unfeasible. The MLM(s) may leverage predictions made by the DNN to predict one or more actions for the vehicle.

Type: Application

Filed: November 1, 2021

Publication date: May 5, 2022

Inventors: Nikolai Smolyanskiy, Alexey Kamenev, Lirui Wang, David Nister, Ollin Boer Bohan, Ishwar Kulkarni, Fangkai Yang, Julia Ng, Alperen Degirmenci, Ruchi Bhargava, Rotem Aviv
SEGMENTATION OF LIDAR RANGE IMAGES

Publication number: 20210342608

Abstract: A deep neural network(s) (DNN) may be used to detect objects from sensor data of a three dimensional (3D) environment. For example, a multi-view perception DNN may include multiple constituent DNNs or stages chained together that sequentially process different views of the 3D environment. An example DNN may include a first stage that performs class segmentation in a first view (e.g., perspective view) and a second stage that performs class segmentation and/or regresses instance geometry in a second view (e.g., top-down). The DNN outputs may be processed to generate 2D and/or 3D bounding boxes and class labels for detected objects in the 3D environment. As such, the techniques described herein may be used to detect and classify animate objects and/or parts of an environment, and these detections and classifications may be provided to an autonomous vehicle drive stack to enable safe planning and control of the autonomous vehicle.

Type: Application

Filed: July 15, 2021

Publication date: November 4, 2021

Inventors: Nikolai Smolyanskiy, Ryan Oldja, Ke Chen, Alexander Popov, Joachim Pehserl, Ibrahim Eden, Tilman Wekel, David Wehr, Ruchi Bhargava, David Nister
TOP-DOWN OBJECT DETECTION FROM LIDAR POINT CLOUDS

Publication number: 20210342609

Abstract: A deep neural network(s) (DNN) may be used to detect objects from sensor data of a three dimensional (3D) environment. For example, a multi-view perception DNN may include multiple constituent DNNs or stages chained together that sequentially process different views of the 3D environment. An example DNN may include a first stage that performs class segmentation in a first view (e.g., perspective view) and a second stage that performs class segmentation and/or regresses instance geometry in a second view (e.g., top-down). The DNN outputs may be processed to generate 2D and/or 3D bounding boxes and class labels for detected objects in the 3D environment. As such, the techniques described herein may be used to detect and classify animate objects and/or parts of an environment, and these detections and classifications may be provided to an autonomous vehicle drive stack to enable safe planning and control of the autonomous vehicle.

Type: Application

Filed: July 15, 2021

Publication date: November 4, 2021

Inventors: Nikolai Smolyanskiy, Ryan Oldja, Ke Chen, Alexander Popov, Joachim Pehserl, Ibrahim Eden, Tilman Wekel, David Wehr, Ruchi Bhargava, David Nister
FUTURE TRAJECTORY PREDICTIONS IN MULTI-ACTOR ENVIRONMENTS FOR AUTONOMOUS MACHINE APPLICATIONS

Publication number: 20210295171

Abstract: In various examples, past location information corresponding to actors in an environment and map information may be applied to a deep neural network (DNN)—such as a recurrent neural network (RNN)—trained to compute information corresponding to future trajectories of the actors. The output of the DNN may include, for each future time slice the DNN is trained to predict, a confidence map representing a confidence for each pixel that an actor is present and a vector field representing locations of actors in confidence maps for prior time slices. The vector fields may thus be used to track an object through confidence maps for each future time slice to generate a predicted future trajectory for each actor. The predicted future trajectories, in addition to tracked past trajectories, may be used to generate full trajectories for the actors that may aid an ego-vehicle in navigating the environment.

Type: Application

Filed: March 19, 2020

Publication date: September 23, 2021

Inventors: Alexey Kamenev, Nikolai Smolyanskiy, Ishwar Kulkarni, Ollin Boer Bohan, Fangkai Yang, Alperen Degirmenci, Ruchi Bhargava, Urs Muller, David Nister, Rotem Aviv
DEEP NEURAL NETWORK FOR DETECTING OBSTACLE INSTANCES USING RADAR SENSORS IN AUTONOMOUS MACHINE APPLICATIONS

Publication number: 20210156963

Abstract: In various examples, a deep neural network(s) (e.g., a convolutional neural network) may be trained to detect moving and stationary obstacles from RADAR data of a three dimensional (3D) space. In some embodiments, ground truth training data for the neural network(s) may be generated from LIDAR data. More specifically, a scene may be observed with RADAR and LIDAR sensors to collect RADAR data and LIDAR data for a particular time slice. The RADAR data may be used for input training data, and the LIDAR data associated with the same or closest time slice as the RADAR data may be annotated with ground truth labels identifying objects to be detected. The LIDAR labels may be propagated to the RADAR data, and LIDAR labels containing less than some threshold number of RADAR detections may be omitted. The (remaining) LIDAR labels may be used to generate ground truth data.

Type: Application

Filed: March 31, 2020

Publication date: May 27, 2021

Inventors: Alexander Popov, Nikolai Smolyanskiy, Ryan Oldja, Shane Murray, Tilman Wekel, David Nister, Joachim Pehserl, Ruchi Bhargava, Sangmin Oh
DEEP NEURAL NETWORK FOR DETECTING OBSTACLE INSTANCES USING RADAR SENSORS IN AUTONOMOUS MACHINE APPLICATIONS

Publication number: 20210156960

Abstract: In various examples, a deep neural network(s) (e.g., a convolutional neural network) may be trained to detect moving and stationary obstacles from RADAR data of a three dimensional (3D) space, in both highway and urban scenarios. RADAR detections may be accumulated, ego-motion-compensated, orthographically projected, and fed into a neural network(s). The neural network(s) may include a common trunk with a feature extractor and several heads that predict different outputs such as a class confidence head that predicts a confidence map and an instance regression head that predicts object instance data for detected objects. The outputs may be decoded, filtered, and/or clustered to form bounding shapes identifying the location, size, and/or orientation of detected object instances. The detected object instances may be provided to an autonomous vehicle drive stack to enable safe planning and control of the autonomous vehicle.

Type: Application

Filed: March 31, 2020

Publication date: May 27, 2021

Inventors: Alexander Popov, Nikolai Smolyanskiy, Ryan Oldja, Shane Murray, Tilman Wekel, David Nister, Joachim Pehserl, Ruchi Bhargava, Sangmin Oh
MULTI-VIEW DEEP NEURAL NETWORK FOR LIDAR PERCEPTION

Publication number: 20210150230

Abstract: A deep neural network(s) (DNN) may be used to detect objects from sensor data of a three dimensional (3D) environment. For example, a multi-view perception DNN may include multiple constituent DNNs or stages chained together that sequentially process different views of the 3D environment. An example DNN may include a first stage that performs class segmentation in a first view (e.g., perspective view) and a second stage that performs class segmentation and/or regresses instance geometry in a second view (e.g., top-down). The DNN outputs may be processed to generate 2D and/or 3D bounding boxes and class labels for detected objects in the 3D environment. As such, the techniques described herein may be used to detect and classify animate objects and/or parts of an environment, and these detections and classifications may be provided to an autonomous vehicle drive stack to enable safe planning and control of the autonomous vehicle.

Type: Application

Filed: June 29, 2020

Publication date: May 20, 2021

Inventors: Nikolai Smolyanskiy, Ryan Oldja, Ke Chen, Alexander Popov, Joachim Pehserl, Ibrahim Eden, Tilman Wekel, David Wehr, Ruchi Bhargava, David Nister
MAP CREATION AND LOCALIZATION FOR AUTONOMOUS DRIVING APPLICATIONS

Publication number: 20210063199

Abstract: An end-to-end system for data generation, map creation using the generated data, and localization to the created map is disclosed. Mapstreams—or streams of sensor data, perception outputs from deep neural networks (DNNs), and/or relative trajectory data—corresponding to any number of drives by any number of vehicles may be generated and uploaded to the cloud. The mapstreams may be used to generate map data—and ultimately a fused high definition (HD) map—that represents data generated over a plurality of drives. When localizing to the fused HD map, individual localization results may be generated based on comparisons of real-time data from a sensor modality to map data corresponding to the same sensor modality. This process may be repeated for any number of sensor modalities and the results may be fused together to determine a final fused localization result.

Type: Application

Filed: August 31, 2020

Publication date: March 4, 2021

Inventors: Amir Akbarzadeh, David Nister, Ruchi Bhargava, Birgit Henke, Ivana Stojanovic, Yu Sheng
MAP CREATION AND LOCALIZATION FOR AUTONOMOUS DRIVING APPLICATIONS

Publication number: 20210063198

Abstract: An end-to-end system for data generation, map creation using the generated data, and localization to the created map is disclosed. Mapstreams—or streams of sensor data, perception outputs from deep neural networks (DNNs), and/or relative trajectory data—corresponding to any number of drives by any number of vehicles may be generated and uploaded to the cloud. The mapstreams may be used to generate map data—and ultimately a fused high definition (HD) map—that represents data generated over a plurality of drives. When localizing to the fused HD map, individual localization results may be generated based on comparisons of real-time data from a sensor modality to map data corresponding to the same sensor modality. This process may be repeated for any number of sensor modalities and the results may be fused together to determine a final fused localization result.

Type: Application

Filed: August 31, 2020

Publication date: March 4, 2021

Inventors: David Nister, Ruchi Bhargava, Vaibhav Thukral, Michael Grabner, Ibrahim Eden, Jeffrey Liu
MAP CREATION AND LOCALIZATION FOR AUTONOMOUS DRIVING APPLICATIONS

Publication number: 20210063200

Abstract: An end-to-end system for data generation, map creation using the generated data, and localization to the created map is disclosed. Mapstreams—or streams of sensor data, perception outputs from deep neural networks (DNNs), and/or relative trajectory data—corresponding to any number of drives by any number of vehicles may be generated and uploaded to the cloud. The mapstreams may be used to generate map data—and ultimately a fused high definition (HD) map—that represents data generated over a plurality of drives. When localizing to the fused HD map, individual localization results may be generated based on comparisons of real-time data from a sensor modality to map data corresponding to the same sensor modality. This process may be repeated for any number of sensor modalities and the results may be fused together to determine a final fused localization result.

Type: Application

Filed: August 31, 2020

Publication date: March 4, 2021

Inventors: Michael Kroepfl, Amir Akbarzadeh, Ruchi Bhargava, Vaibhav Thukral, Neda Cvijetic, Vadim Cugunovs, David Nister, Birgit Henke, Ibrahim Eden, Youding Zhu, Michael Grabner, Ivana Stojanovic, Yu Sheng, Jeffrey Liu, Enliang Zheng, Jordan Marr, Andrew Carley
DEEP NEURAL NETWORK FOR SEGMENTATION OF ROAD SCENES AND ANIMATE OBJECT INSTANCES FOR AUTONOMOUS DRIVING APPLICATIONS

Publication number: 20210026355

Abstract: A deep neural network(s) (DNN) may be used to perform panoptic segmentation by performing pixel-level class and instance segmentation of a scene using a single pass of the DNN. Generally, one or more images and/or other sensor data may be stitched together, stacked, and/or combined, and fed into a DNN that includes a common trunk and several heads that predict different outputs. The DNN may include a class confidence head that predicts a confidence map representing pixels that belong to particular classes, an instance regression head that predicts object instance data for detected objects, an instance clustering head that predicts a confidence map of pixels that belong to particular instances, and/or a depth head that predicts range values. These outputs may be decoded to identify bounding shapes, class labels, instance labels, and/or range values for detected objects, and used to enable safe path planning and control of an autonomous vehicle.

Type: Application

Filed: July 24, 2020

Publication date: January 28, 2021

Inventors: Ke Chen, Nikolai Smolyanskiy, Alexey Kamenev, Ryan Oldja, Tilman Wekel, David Nister, Joachim Pehserl, Ibrahim Eden, Sangmin Oh, Ruchi Bhargava
SYSTEMS AND METHODS FOR COMPUTER-ASSISTED SHUTTLES, BUSES, ROBO-TAXIS, RIDE-SHARING AND ON-DEMAND VEHICLES WITH SITUATIONAL AWARENESS

Publication number: 20190265703

Abstract: A system and method for an on-demand shuttle, bus, or taxi service able to operate on private and public roads provides situational awareness and confidence displays. The shuttle may include ISO 26262 Level 4 or Level 5 functionality and can vary the route dynamically on-demand, and/or follow a predefined route or virtual rail. The shuttle is able to stop at any predetermined station along the route. The system allows passengers to request rides and interact with the system via a variety of interfaces, including without limitation a mobile device, desktop computer, or kiosks. Each shuttle preferably includes an in-vehicle controller, which preferably is an AI Supercomputer designed and optimized for autonomous vehicle functionality, with computer vision, deep learning, and real time ray tracing accelerators. An AI Dispatcher performs AI simulations to optimize system performance according to operator-specified system parameters.

Type: Application

Filed: February 26, 2019

Publication date: August 29, 2019

Inventors: Gary HICOK, Michael COX, Miguel SAINZ, Martin HEMPEL, Ratin KUMAR, Timo ROMAN, Gordon GRIGOR, David NISTER, Justin EBERT, Chin SHIH, Tony TAM, Ruchi BHARGAVA
EXTERNAL VISUAL INTERACTIONS FOR SPEECH-BASED DEVICES

Publication number: 20160162259

Abstract: Examples are disclosed herein that are related to providing extended functionalities on-demand to an audio-based wearable device. One example provides a wearable computing device including an acoustic receiver configured to receive speech inputs, a speaker configured to present audio outputs, a communications subsystem configured to connect to an external device, a logic subsystem configured to execute instructions, and a storage subsystem having instructions executable by the logic subsystem to execute a program, connect to the external device via a wireless communications protocol, conduct an audio-based interaction of the program via the speech inputs received at the acoustic receiver and the audio outputs provided by the speaker, upon reaching a screen-based interaction of the program, notify a user via the speaker to interact with the external device, and provide image data to the external device for presentation via a screen of the external device.

Type: Application

Filed: December 5, 2014

Publication date: June 9, 2016

Inventors: Weidong Zhao, Gary Caldwell, Ruchi Bhargava
Eye-typing term recognition

Patent number: 9304586

Abstract: Various embodiments related to entering text into a computing device via eye-typing are disclosed. For example, one embodiment provides a method that includes receiving a data set including a plurality of gaze samples, each gaze sample including a gaze location and a corresponding point in time. The method further comprises processing the plurality of gaze samples to determine one or more likely terms represented by the data set.

Type: Grant

Filed: December 17, 2014

Date of Patent: April 5, 2016

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: David Nister, Vaibhav Thukral, Djordje Nijemcevic, Ruchi Bhargava
EYE-TYPING TERM RECOGNITION

Publication number: 20150103000

Abstract: Various embodiments related to entering text into a computing device via eye-typing are disclosed. For example, one embodiment provides a method that includes receiving a data set including a plurality of gaze samples, each gaze sample including a gaze location and a corresponding point in time. The method further comprises processing the plurality of gaze samples to determine one or more likely terms represented by the data set.

Type: Application

Filed: December 17, 2014

Publication date: April 16, 2015

Inventors: David Nister, Vaibhav Thukral, Djordje Nijemcevic, Ruchi Bhargava
Eye-typing term recognition

Patent number: 8917238

Abstract: Various embodiments related to entering text into a computing device via eye-typing are disclosed. For example, one embodiment provides a method that includes receiving a data set including a plurality of gaze samples, each gaze sample including a gaze location and a corresponding point in time. The method further comprises processing the plurality of gaze samples to determine one or more likely terms represented by the data set.

Type: Grant

Filed: June 28, 2012

Date of Patent: December 23, 2014

Assignee: Microsoft Corporation

Inventors: David Nister, Vaibhav Thukral, Djordje Nijemcevic, Ruchi Bhargava
EYE-TYPING TERM RECOGNITION

Publication number: 20140002341

Abstract: Various embodiments related to entering text into a computing device via eye-typing are disclosed. For example, one embodiment provides a method that includes receiving a data set including a plurality of gaze samples, each gaze sample including a gaze location and a corresponding point in time. The method further comprises processing the plurality of gaze samples to determine one or more likely terms represented by the data set.

Type: Application

Filed: June 28, 2012

Publication date: January 2, 2014

Inventors: David Nister, Vaibhav Thukral, Djordje Nijemcevic, Ruchi Bhargava

prev 1 2