Patents by Inventor Brojeshwar Bhowmick

Brojeshwar Bhowmick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS AND SYSTEM FOR GOAL-CONDITIONED EXPLORATION FOR OBJECT GOAL NAVIGATION

Publication number: 20230080342

Abstract: In state of the art methods for object goal navigation, scene understanding is implicit in their goal oriented exploration policies. Implicit scene understanding coupled with navigation is shown to be specific to tasks for which training is done and not generalizable to new tasks. Thus, embodiments of present disclosure propose a method of goal-conditioned exploration wherein scene understanding is decoupled from the exploration policies. Here, the scene understanding required for navigation is provided by a region classification network that is trained using semantic graphs representing the scene and agent can be navigated towards the goal either by using any state of the art pure exploration policies or by traversing through potential sub-goals identified based on a Co-occurrence Likelihood score calculated by using predictions from the region classification network. Hence, the method of present disclosure can be easily generalized to new tasks and new environments.

Type: Application

Filed: July 18, 2022

Publication date: March 16, 2023

Applicant: Tata Consultancy Services Limited

Inventors: RUDDRA DEV ROYCHOUDHURY, BROJESHWAR BHOWMICK, MADHAVA KRISHNA KRISHNAN, GULSHAN KUMAR, SAI SHANKAR NARASIMHAN, HIMANSU DIDWANIA
Knowledge partitioning for task execution by conversational tele-presence robots in a geographically separated environment

Patent number: 11597080

Abstract: Conventional tele-presence robots have their own limitations with respect to task execution, information processing and management. Embodiments of the present disclosure provide a tele-presence robot (TPR) that communicates with a master device associated with a user via an edge device for task execution wherein control command from the master device is parsed for determining instructions set and task type for execution. Based on this determination, the TPR queries for information across storage devices until a response is obtained enough to execute task. The task upon execution is validated with the master device and user. Knowledge acquired, during querying, task execution and validation of the executed task, is dynamically partitioned by the TPR across storage devices namely, on-board memory of the tele-present robot, an edge device, a cloud and a web interface respectively depending upon the task type, operating environment of the tele-presence robot, and other performance affecting parameters.

Type: Grant

Filed: September 9, 2020

Date of Patent: March 7, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Chayan Sarkar, Snehasis Banerjee, Pradip Pramanick, Hrishav Bakul Barua, Soumyadip Maity, Dipanjan Das, Brojeshwar Bhowmick, Ashis Sau, Abhijan Bhattacharyya, Arpan Pal, Balamuralidhar Purushothaman, Ruddra Roy Chowdhury
METHOD AND SYSTEM FOR GENERATING 3D MESH OF A SCENE USING RGBD IMAGE SEQUENCE

Publication number: 20230063722

Abstract: Traditional machine learning (ML) based systems used for scene recognition and object recognition have the disadvantage that they require huge quantity of labeled data to generate data models for the purpose of aiding the scene and object recognition. The disclosure herein generally relates to image processing, and, more particularly, to method and system for generating 3D mesh generation using planar and non-planar data. The system extracts planar point cloud and non-planar point cloud from each RGBD image in a sequence of RGBD images fetched as input, and then generates a planar mesh and a non-planar mesh for planar and non-planar objects in the image. A mesh representation is generated by merging the planar mesh and the non-planar mesh. Further, an incremental merging of the mesh representation is performed on the sequence of RGBD images, based on an estimated camera pose information, to generate representation of the scene.

Type: Application

Filed: June 16, 2022

Publication date: March 2, 2023

Applicant: Tata Consultancy Services Limited

Inventors: SWAPNA AGARWAL, SOUMYADIP MAITY, HRISHAV BAKUL BARUA, BROJESHWAR BHOWMICK
Edge centric communication protocol for remotely maneuvering a tele-presence robot in a geographically distributed environment

Patent number: 11573563

Abstract: Robotic platform for tele-presence applications has gained paramount importance, such as for remote meetings, group discussions, and the like and has sought much attention. There exist some robotic platforms for such tele-presence applications, these lack efficacy in communication and interaction between remote person and avatar robot deployed in another geographic location thus adding network overhead. Embodiments of the present disclosure for edge centric communication protocol for remotely maneuvering tele-presence robot in geographically distributed environment.

Type: Grant

Filed: August 7, 2020

Date of Patent: February 7, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Abhijan Bhattacharyya, Ashis Sau, Ruddra Dev Roychoudhury, Hrishav Bakul Barua, Chayan Sarkar, Sayan Paul, Brojeshwar Bhowmick, Arpan Pal, Balamuralidhar Purushothaman
Audio-speech driven animated talking face generation using a cascaded generative adversarial network

Patent number: 11551394

Abstract: Conventional state-of-the-art methods are limited in their ability to generate realistic animation from audio on any unknown faces and cannot be easily generalized to different facial characteristics and voice accents. Further, these methods fail to produce realistic facial animation for subjects which are quite different than that of distribution of facial characteristics network has seen during training. Embodiments of the present disclosure provide systems and methods that generate audio-speech driven animated talking face using a cascaded generative adversarial network (CGAN), wherein a first GAN is used to transfer lip motion from canonical face to person-specific face. A second GAN based texture generator network is conditioned on person-specific landmark to generate high-fidelity face corresponding to the motion. Texture generator GAN is made more flexible using meta learning to adapt to unknown subject's traits and orientation of face during inference.

Type: Grant

Filed: March 11, 2021

Date of Patent: January 10, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Sandika Biswas, Dipanjan Das, Sanjana Sinha, Brojeshwar Bhowmick
Method and a system for hierarchical network based diverse trajectory proposal

Patent number: 11526174

Abstract: The disclosure herein generally relates to the field of autonomous navigation, and, more particularly, to a diverse trajectory proposal for autonomous navigation. The embodiment discloses a hierarchical network based diverse trajectory proposal for autonomous navigation. The hierarchical 2-stage neural network architecture maps the perceived surroundings to diverse trajectories in the form of trajectory waypoints, that an autonomous navigation system can choose to navigate/traverse. The first stage of the disclosed hierarchical 2-stage Neural Network architecture is a Trajectory Proposal Network which generates a set of diverse traversable regions in an environment which can be occupied by the autonomous navigation system in the future. The second stage is a Trajectory Sampling network which predicts a fine-grained trajectory/trajectory waypoint over the diverse traversable regions proposed by Trajectory Proposal Network.

Type: Grant

Filed: June 5, 2020

Date of Patent: December 13, 2022

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Brojeshwar Bhowmick, Krishnam Madhava Krishna, Sriram Nochur Narayanan, Gourav Kumar, Abhay Singh, Siva Karthik Mustikovela, Saket Saurav
METHOD AND SYSTEM FOR DRAPING A 3D GARMENT ON A 3D HUMAN BODY

Publication number: 20220368882

Abstract: This disclosure relates generally to method and system for draping a 3D garment on a 3D human body. Dressing digital humans in 3D have gained much attention due to its use in online shopping and draping 3D garments over the 3D human body has immense applications in virtual try-on, animations, and accurate fitment of the 3D garment is the utmost importance. The proposed disclosure is a single unified garment deformation model that learns the shared space of variations for a body shape, a body pose, and a styling garment. The method receives a plurality of human body inputs to construct a 3D skinned garments for the subject. The deep draper network trained using a plurality of losses provides efficient deep neural network based method that predicts fast and accurate 3D garment images. The method couples the geometric and multi-view perceptual constraints that efficiently learn the garment deformation's high-frequency geometry.

Type: Application

Filed: December 29, 2021

Publication date: November 17, 2022

Applicant: Tata Consultancy Services Limited

Inventors: LOKENDER TIWARI, BROJESHWAR BHOWMICK
Methods and systems for enabling human-robot interaction to resolve task ambiguity

Patent number: 11501777

Abstract: The disclosure herein relates to methods and systems for enabling human-robot interaction (HRI) to resolve task ambiguity. Conventional techniques that initiates continuous dialogue with the human to ask a suitable question based on the observed scene until resolving the ambiguity are limited. The present disclosure use the concept of Talk-to-Resolve (TTR) which initiates a continuous dialogue with the user based on visual uncertainty analysis and by asking a suitable question that convey the veracity of the problem to the user and seek guidance until all the ambiguities are resolved. The suitable question is formulated based on the scene understanding and the argument spans present in the natural language instruction. The present disclosure asks questions in a natural way that not only ensures that the user can understand the type of confusion, the robot is facing; but also ensures minimal and relevant questioning to resolve the ambiguities.

Type: Grant

Filed: January 29, 2021

Date of Patent: November 15, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Chayan Sarkar, Pradip Pramanick, Snehasis Banerjee, Brojeshwar Bhowmick
Method and system for prediction of correct discrete sensor data based on temporal uncertainty

Patent number: 11429467

Abstract: This disclosure relates generally to a method and system for prediction of correct discrete sensor data, thus enabling continuous flow of data even when a discrete sensor fails. The activities of humans/subjects, housed in a smart environment is continuously monitored by plurality of non-intrusive discrete sensors embedded in living infrastructure. The collected discrete sensor data is usually sparse and largely unbalanced, wherein most of the discrete sensor data is ‘No’ and comparatively only a few samples of ‘Yes’, hence making prediction very challenging. The proposed prediction techniques based on introduction of temporal uncertainty is performed in several stages which includes pre-processing of received discrete sensor data, introduction of temporal uncertainty techniques followed by prediction based on neural network techniques of learning pattern using historical data.

Type: Grant

Filed: December 27, 2019

Date of Patent: August 30, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Avik Ghose, Brojeshwar Bhowmick
NAVIGATION OF TELE-ROBOT IN DYNAMIC ENVIRONMENT USING IN-SITU INTELLIGENCE

Publication number: 20220219325

Abstract: This disclosure relates generally to navigation of a tele-robot in dynamic environment using in-situ intelligence. Tele-robotics is the area of robotics concerned with the control of robots (tele-robots) in a remote environment from a distance. In reality the remote environment where the tele robot navigates may be dynamic in nature with unpredictable movements, making the navigation extremely challenging. The disclosure proposes an in-situ intelligent navigation of a tele-robot in a dynamic environment. The disclosed in-situ intelligence enables the tele-robot to understand the dynamic environment by identification and estimation of future location of objects based on a generating/training a motion model. Further the disclosed techniques also enable communication between a master and the tele-robot (whenever necessary) based on an application layer communication semantic.

Type: Application

Filed: March 11, 2021

Publication date: July 14, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Abhijan BHATTACHARYYA, Ruddra dev ROYCHOUDHURY, Sanjana SINHA, Sandika BISWAS, Ashis SAU, Madhurima GANGULY, Sayan PAUL, Brojeshwar BHOWMICK
METHODS AND SYSTEMS FOR ENABLING HUMAN-ROBOT INTERACTION TO RESOLVE TASK AMBIGUITY

Publication number: 20220148586

Abstract: The disclosure herein relates to methods and systems for enabling human-robot interaction (HRI) to resolve task ambiguity. Conventional techniques that initiates continuous dialogue with the human to ask a suitable question based on the observed scene until resolving the ambiguity are limited. The present disclosure use the concept of Talk-to-Resolve (TTR) which initiates a continuous dialogue with the user based on visual uncertainty analysis and by asking a suitable question that convey the veracity of the problem to the user and seek guidance until all the ambiguities are resolved. The suitable question is formulated based on the scene understanding and the argument spans present in the natural language instruction. The present disclosure asks questions in a natural way that not only ensures that the user can understand the type of confusion, the robot is facing; but also ensures minimal and relevant questioning to resolve the ambiguities.

Type: Application

Filed: January 29, 2021

Publication date: May 12, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Chayan SARKAR, Pradip Pramanick, Snehasis Banerjee, Brojeshwar Bhowmick
Method and system for generating face animations from speech signal input

Patent number: 11295501

Abstract: Most of the prior art references that generate animations fail to determine and consider head movement data. The prior art references which consider the head movement data for generating the animations rely on a sample video to generate/determine the head movements data, which, as a result, fail to capture changing head motions throughout course of a speech given by a subject in an actual whole length video. The disclosure herein generally relates to generating facial animations, and, more particularly, to a method and system for generating the facial animations from speech signal of a subject. The system determines the head movement, lip movements, and eyeball movements, of the subject, by processing a speech signal collected as input, and uses the head movement, lip movements, and eyeball movements, to generate an animation.

Type: Grant

Filed: March 1, 2021

Date of Patent: April 5, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Sandika Biswas, Dipanjan Das, Sanjana Sinha, Brojeshwar Bhowmick
System and method for stitching images using non-linear optimization and multi-constraint cost function minimization

Patent number: 11288769

Abstract: The present disclosure provides a system and a method for stitching images using non-linear optimization and multi-constraint cost function minimization. Most of conventional homography based transformation approaches for image alignment, calculate transformations based on linear algorithms which ignore parameters such as lens distortion and unable to handle parallax for non-planar images resulting in improper image stitching with misalignments. The disclosed system and the method generates initial stitched image by estimating a global homography for each image using estimated pairwise homography matrix and feature point correspondences for each pair of images, based on a non-linear optimization. Local warping based image alignment is applied on the initial stitched image, using multi-constraint cost function minimization to mitigate aberrations caused by noises in the global homography estimation to generate the refined stitched image.

Type: Grant

Filed: March 26, 2020

Date of Patent: March 29, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Arindam Saha, Soumyadip Maity, Brojeshwar Bhowmick
SYSTEM AND METHOD FOR FORECASTING LOCATION OF TARGET IN MONOCULAR FIRST PERSON VIEW

Publication number: 20220076431

Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.

Type: Application

Filed: August 18, 2021

Publication date: March 10, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Junaid Ahmed ANSARI, Brojeshwar Bhowmick
METHOD AND SYSTEM FOR GENERATING 2D ANIMATED LIP IMAGES SYNCHRONIZING TO AN AUDIO SIGNAL

Publication number: 20220058850

Abstract: This disclosure relates generally to a method and system for generating 2D animated lip images synchronizing to an audio signal for an unseen subject. Recent advances in Convolutional Neural Network (CNN) based approaches generate convincing talking heads. Personalization of such talking heads requires training the model with large number of samples of the target person which is time consuming. The lip generator system receives an audio signal and a target lip image of an unseen target subject as inputs from a user and processes these inputs to extract a plurality of high dimensional audio image features. The lip generator system is meta-trained with training dataset which consists of large variety of subjects' ethnicity and vocabulary. The meta-trained model generates realistic animation for previously unseen face and unseen audio when finetuned with only a few-shot samples for a predefined interval of time. Additionally, the method protects intrinsic features of the unseen target subject.

Type: Application

Filed: August 18, 2021

Publication date: February 24, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Swapna AGARWAL, Dipanjan DAS, Brojeshwar BHOWMICK
Weakly supervised learning of 3D human poses from 2D poses

Patent number: 11256962

Abstract: Estimating 3D human pose from monocular images is a challenging problem due to the variety and complexity of human poses and the inherent ambiguity in recovering depth from single view. Recent deep learning based methods show promising results by using supervised learning on 3D pose annotated datasets. However, the lack of large-scale 3D annotated training data makes the 3D pose estimation difficult in-the-wild. Embodiments of the present disclosure provide a method which can effectively predict 3D human poses from only 2D pose in a weakly-supervised manner by using both ground-truth 3D pose and ground-truth 2D pose based on re-projection error minimization as a constraint to predict the 3D joint locations. The method may further utilize additional geometric constraints on reconstructed body parts to regularize the pose in 3D along with minimizing re-projection error to improvise on estimating an accurate 3D pose.

Type: Grant

Filed: March 11, 2020

Date of Patent: February 22, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Sandika Biswas, Sanjana Sinha, Kavya Gupta, Brojeshwar Bhowmick
AUDIO-SPEECH DRIVEN ANIMATED TALKING FACE GENERATION USING A CASCADED GENERATIVE ADVERSARIAL NETWORK

Publication number: 20220036617

Abstract: Conventional state-of-the-art methods are limited in their ability to generate realistic animation from audio on any unknown faces and cannot be easily generalized to different facial characteristics and voice accents. Further, these methods fail to produce realistic facial animation for subjects which are quite different than that of distribution of facial characteristics network has seen during training. Embodiments of the present disclosure provide systems and methods that generate audio-speech driven animated talking face using a cascaded generative adversarial network (CGAN), wherein a first GAN is used to transfer lip motion from canonical face to person-specific face. A second GAN based texture generator network is conditioned on person-specific landmark to generate high-fidelity face corresponding to the motion. Texture generator GAN is made more flexible using meta learning to adapt to unknown subject's traits and orientation of face during inference.

Type: Application

Filed: March 11, 2021

Publication date: February 3, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Sandika BISWAS, Dipanjan DAS, Sanjana SINHA, Brojeshwar BHOWMICK
Systems and methods for coupled representation using transform learning for solving inverse problems

Patent number: 11216692

Abstract: This disclosure relates to systems and methods for solving generic inverse problems by providing a coupled representation architecture using transform learning. Convention solutions are complex, require long training and testing times, reconstruction quality also may not be suitable for all applications. Furthermore, they preclude application to real-time scenarios due to the mentioned inherent lacunae. The methods provided herein require involve very low computational complexity with a need for only three matrix-vector products, and requires very short training and testing times, which makes it applicable for real-time applications. Unlike the conventional learning architectures using inductive approaches, the CASC of the present disclosure can learn directly from the source domain and the number of features in a source domain may not be necessarily equal to the number of features in a target domain.

Type: Grant

Filed: July 3, 2019

Date of Patent: January 4, 2022

Assignee: Tata Consultancy Services Limited

Inventors: Kavya Gupta, Brojeshwar Bhowmick, Angshul Majumdar
METHODS AND SYSTEMS FOR ENABLING HUMAN ROBOT INTERACTION BY SHARING COGNITION

Publication number: 20210370516

Abstract: The disclosure generally relates to methods and systems for enabling human robot interaction by cognition sharing which includes gesture and audio. Conventional techniques that use the gestures and the speech, require extra hardware setup and are limited to navigation in structured outdoor driving environments. The present disclosure herein provides methods and systems that solves the technical problem of enabling the human robot interaction with a two-step approach by transferring the cognitive load from the human to the robot. An accurate shared perspective associated with the task is determined in the first step by computing relative frame transformations based on understanding of navigational gestures of the subject. Then, the shared perspective transformed to the robot in the field view of the robot. The transformed shared perspective is then given to a language grounding technique in the second step, to accurately determine a final goal associated with the task.

Type: Application

Filed: February 4, 2021

Publication date: December 2, 2021

Applicant: Tata Consultancy Services Limited

Inventors: Soumyadip MAITY, Gourav KUMAR, Ruddra Dev ROY CHOUDHURY, Brojeshwar BHOWMICK
IDENTITY PRESERVING REALISTIC TALKING FACE GENERATION USING AUDIO SPEECH OF A USER

Publication number: 20210366173

Abstract: Speech-driven facial animation is useful for a variety of applications such as telepresence, chatbots, etc. The necessary attributes of having a realistic face animation are: 1) audiovisual synchronization, (2) identity preservation of the target individual, (3) plausible mouth movements, and (4) presence of natural eye blinks. Existing methods mostly address audio-visual lip synchronization, and synthesis of natural facial gestures for overall video realism. However, existing approaches are not accurate. Present disclosure provides system and method that learn motion of facial landmarks as an intermediate step before generating texture. Person-independent facial landmarks are generated from audio for invariance to different voices, accents, etc. Eye blinks are imposed on facial landmarks and the person-independent landmarks are retargeted to person-specific landmarks to preserve identity related facial structure.

Type: Application

Filed: September 29, 2020

Publication date: November 25, 2021

Applicant: Tata Consultancy Services Limited

Inventors: Sanjana SINHA, Sandika BISWAS, Brojeshwar BHOWMICK

prev 1 2 3 4 5 next