Patents by Inventor Dhiraj Joshi
Dhiraj Joshi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12249041Abstract: Described are techniques for oblique image rectification. The techniques include receiving an original image depicting an oblique view of a circular object and pre-processing the original image into an edge image. The techniques further include generating, by a machine learning model based on the edge image, a heatmap including an ellipse formed by the oblique view of the circular object. The techniques further include computing ellipse parameters describing the ellipse of the heatmap. The techniques further include performing, using the ellipse parameters, an affine transformation on the original image to generate a rectified image, where the rectified image converts the ellipse to a circle.Type: GrantFiled: October 24, 2022Date of Patent: March 11, 2025Assignee: International Business Machines CorporationInventors: Sebastien Gilbert, Michele Merler, Dhiraj Joshi, Apurv Gupta, Shyama Prosad Chowdhury, Chidansh Amitkumar Bhatt, Nirmit V. Desai
-
Patent number: 12186907Abstract: Dynamically adjusting, using artificial intelligence (AI), sensors and models of an autonomous roaming robotic device, which includes receiving data regarding an asset at a computer of a roaming robotic device from sensors on the robotic device. The robotic device identifies an asset at a location using the sensors, and the robotic device has instructions, received from a control system, to inspect the location or items at the location. The data is analyzed using the computer of the robotic device, and the analysis includes using historical data for the asset. An AI model is loaded using the computer of the robotic device, based on the identification of the asset. A sensor is selected using the computer of the robotic device, for conducting an inspection of the asset based on the analysis of the data and the AI model.Type: GrantFiled: October 26, 2021Date of Patent: January 7, 2025Assignee: International Business Machines CorporationInventors: Jenny S. Li, Raghu Ramaswamy, Nirmit V Desai, Dhiraj Joshi, Satish Rajani, Nancy Anne Greco, Shiva G, Aakash Praliya, Wei-Han Lee, Luis Angel Bathen, Tova Roth, Sujoy Kumar Roy Chowdhury, Prakriti Pritmani, Kay Murphy, Shilpa Shenai, Arun Yashwant Ingale, Ajjay Ratnakar, Gwilym Benjamin Lee Newton
-
Publication number: 20240233067Abstract: Described are techniques for oblique image rectification. The techniques include receiving an original image depicting an oblique view of a circular object and pre-processing the original image into an edge image. The techniques further include generating, by a machine learning model based on the edge image, a heatmap including an ellipse formed by the oblique view of the circular object. The techniques further include computing ellipse parameters describing the ellipse of the heatmap. The techniques further include performing, using the ellipse parameters, an affine transformation on the original image to generate a rectified image, where the rectified image converts the ellipse to a circle.Type: ApplicationFiled: October 24, 2022Publication date: July 11, 2024Inventors: Sebastien Gilbert, Michele Merler, Dhiraj Joshi, Apurv Gupta, Shyama Prosad Chowdhury, CHIDANSH AMITKUMAR BHATT, Nirmit V. Desai
-
Publication number: 20240135486Abstract: Described are techniques for oblique image rectification. The techniques include receiving an original image depicting an oblique view of a circular object and pre-processing the original image into an edge image. The techniques further include generating, by a machine learning model based on the edge image, a heatmap including an ellipse formed by the oblique view of the circular object. The techniques further include computing ellipse parameters describing the ellipse of the heatmap. The techniques further include performing, using the ellipse parameters, an affine transformation on the original image to generate a rectified image, where the rectified image converts the ellipse to a circle.Type: ApplicationFiled: October 23, 2022Publication date: April 25, 2024Inventors: Sebastien Gilbert, Michele Merler, Dhiraj Joshi, Apurv Gupta, Shyama Prosad Chowdhury, CHIDANSH AMITKUMAR BHATT, Nirmit V. Desai
-
Publication number: 20240112444Abstract: Automated analog gauge reading is provided. The method comprises a computer system receiving input of an image and detecting at least one analog gauge in the image. The computer system corrects the orientation of the analog gauge in the image and detects scene text and tick labels on the analog gauge. The computer system determines a position of a pointer on the analog gauge relative to the scene text and outputs a gauge reading value based on an arithmetic progression of tick labels and angle of the pointer with respect to minimum and maximum values on the analog gauge.Type: ApplicationFiled: September 29, 2022Publication date: April 4, 2024Inventors: Michele Merler, Dhiraj Joshi, Apurv Gupta, Sebastien Gilbert, Shyama Prosad Chowdhury, Chidansh Amitkumar Bhatt, Nirmit V. Desai
-
Publication number: 20240104369Abstract: A system may receive an existing base set of knowledge, train a neural network on the base set of knowledge, deploy the neural network on a new data set, generate, using the deployment, instances of new knowledge, and validate the instances of new knowledge.Type: ApplicationFiled: September 26, 2022Publication date: March 28, 2024Inventors: Dinesh C. Verma, Franck Vinh Le, Michele Merler, Dhiraj Joshi, SUPRIYO CHAKRABORTY, Seraphin Bernard Calo
-
Patent number: 11830241Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.Type: GrantFiled: January 25, 2020Date of Patent: November 28, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen C. Hammer, John Joseph Kent, John R. Smith, Rogerio Feris
-
Publication number: 20230186121Abstract: A method, computer program product, and system include a processor(s) that engages, based on a request for an inference, from a group of sensors of multiple modalities at a physical location, sensor(s) of a main modality to provide data to a pipeline to generate the inference. The pipeline includes one or more machine learning models which generate the inference for a downstream task. The processor(s) obtains raw data from the sensor(s) of the main modality and applies an outlier detector to the raw data. Based on determining that there is an outlier the processor(s) automatically engages sensor(s) of at least one different modality than the main modality from the group of sensors of multiple modalities and obtains new raw data from the sensor(s) of the at least one different modality. The processor(s) applies the one or more machine learning models to the new raw data to derive the inference.Type: ApplicationFiled: December 10, 2021Publication date: June 15, 2023Inventors: Jenny S. Li, Nirmit V. Desai, Dhiraj Joshi, Raghu Ramaswamy, Satish Rajani
-
Publication number: 20230126457Abstract: Dynamically adjusting, using artificial intelligence (AI), sensors and models of an autonomous roaming robotic device, which includes receiving data regarding an asset at a computer of a roaming robotic device from sensors on the robotic device. The robotic device identifies an asset at a location using the sensors, and the robotic device has instructions, received from a control system, to inspect the location or items at the location. The data is analyzed using the computer of the robotic device, and the analysis includes using historical data for the asset. An AI model is loaded using the computer of the robotic device, based on the identification of the asset. A sensor is selected using the computer of the robotic device, for conducting an inspection of the asset based on the analysis of the data and the AI model.Type: ApplicationFiled: October 26, 2021Publication date: April 27, 2023Inventors: Jenny S. Li, Raghu Ramaswamy, Nirmit V Desai, Dhiraj Joshi, Satish Rajani, Nancy Anne Greco, Shiva G, Aakash Praliya, Wei-Han Lee, Luis Angel Bathen, Tova Roth, Sujoy Kumar Roy Chowdhury, Prakriti Pritmani, Kay Murphy, Shilpa Shenai, Arun Yashwant Ingale, Ajjay Ratnakar, Gwilym Benjamin Lee Newton
-
Publication number: 20230124038Abstract: Optimizing sensing capabilities of a roaming robotic device using Artificial Intelligence (AI) includes receiving data at a control system having a computer from a robotic device. The control system communicating a policy to the robotic device for choosing navigation actions for the robotic device. The received data is analyzed using the control system for determining when the received data meets a threshold for determining quality of the data. The analysis can include generating a model based on the received data where the model includes vector representation of inputs detected by a sensor array at the location. In response to the received data at the control system not meeting the threshold for determining quality, the robotic device communicating with the control system to collaborate in updating the policy to choose a next action.Type: ApplicationFiled: October 18, 2021Publication date: April 20, 2023Inventors: Jenny S. Li, Nirmit V. Desai, Dhiraj Joshi, Raghu Ramaswamy, Satish Rajani
-
Patent number: 11521044Abstract: Techniques regarding action detection based on motion in receptive fields of a neural network model are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a motion component that can extract a motion vector from a plurality of adaptive receptive fields in a deformable convolution layer of a neural network model. The computer executable components can also comprise an action detection component that can generate a spatio-temporal feature by concatenating the motion vector with a spatial feature extracted from the deformable convolution layer.Type: GrantFiled: May 17, 2018Date of Patent: December 6, 2022Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, THE BOARD OF TRUSTEES OF THE UNIVERSITY OF ILLINOISInventors: Khoi-Nguyen C. Mac, Raymond Alexander Yeh, Dhiraj Joshi, Minh N. Do, Rogerio Feris, Jinjun Xiong
-
Patent number: 10740802Abstract: Systems and methods to analyze a person's social media photos or videos, such as those posted on Twitter, Facebook, Instagram, etc. and determine properties of their social life. Using information on the number of people appearing in the photos or videos, their ages, and genders, this method can predict whether the person is in a romantic relationship, has a close family, is a group person, or is single. This information is valuable for generating audiovisual content recommendations as well as for advertisers, because it allows targeting personalized advertisements to the person posting the photos. The described methods may be performed (and the advertisements or other content may be selected for recommendation) substantially in real-time as the user accesses a specific online resource.Type: GrantFiled: August 18, 2014Date of Patent: August 11, 2020Assignee: FUJI XEROX CO., LTD.Inventors: Dhiraj Joshi, Lynn Wilcox, Francine Chen
-
Publication number: 20200162799Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.Type: ApplicationFiled: January 25, 2020Publication date: May 21, 2020Inventors: Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen C. Hammer, John Joseph Kent, John R. Smith, Rogerio Feris
-
Patent number: 10595101Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.Type: GrantFiled: March 15, 2018Date of Patent: March 17, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen C. Hammer, John Joseph Kent, John R. Smith, Rogerio Feris
-
Patent number: 10489447Abstract: A method of generating a caption for a social media post is provided. The method may include receiving a social media post to be posted to a social media network; collecting reference data relevant to determining common activities occurring at a category of location associated with the social media post; inferring potential topics for captions from a topic inference model, based on the collected reference data associated with the social media post; setting parameters associated with a language model based on the inferred topic; and generating at least one caption for the social media post based on the identified language model, and the inferred topic.Type: GrantFiled: December 17, 2015Date of Patent: November 26, 2019Assignee: FUJI XEROX CO., LTD.Inventors: Yin-Ying Chen, Francine Chen, Matthew L. Cooper, Dhiraj Joshi
-
Publication number: 20190354835Abstract: Techniques regarding action detection based on motion in receptive fields of a neural network model are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a motion component that can extract a motion vector from a plurality of adaptive receptive fields in a deformable convolution layer of a neural network model. The computer executable components can also comprise an action detection component that can generate a spatio-temporal feature by concatenating the motion vector with a spatial feature extracted from the deformable convolution layer.Type: ApplicationFiled: May 17, 2018Publication date: November 21, 2019Inventors: Khoi-Nguyen C. Mac, Raymond Alexander Yeh, Dhiraj Joshi, Minh N. Do, Rogerio Feris, Jinjun Xiong
-
Publication number: 20190289372Abstract: A method and system for auto-curating a media are provided. Media content is received over the network interface. A set of markers is identified for the media content, each marker corresponding to one of a plurality of visible and audible cues in the media content. Segments in the media content are identified based on the identified set of markers. An excitement score is computed for each segment based on the identified markers that fall within the segment. A highlight clip is generated by identifying segments having excitement scores greater than a threshold.Type: ApplicationFiled: March 15, 2018Publication date: September 19, 2019Inventors: Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen C. Hammer, John Joseph Kent, John R. Smith, Rogerio Feris
-
Patent number: 10318884Abstract: A method associates social media messages with venues. A social network graph includes users, messages from users, and venues. The venues include multiple primary venues and a no-venue. A link between a message and the no-venue node indicates that the message is not associated with a primary venue. Training feature vectors are constructed that measure connectedness between messages and venues. The process trains a classifier to estimate probabilities that messages are associated with venues. A new social media message is received, and the process constructs a feature vector using the same features as the training vectors, measuring connectedness between the new message and the no-venue. The classifier computes a probability that the new message is associated with the no-venue. When the probability exceeds a predefined threshold, the new message is not associated with any of the primary venues. Otherwise, the new message is associated with one of the primary venues.Type: GrantFiled: August 25, 2015Date of Patent: June 11, 2019Assignee: FUJI XEROX CO., LTD.Inventors: Francine Chen, Bokai Cao, Yin-Ying Chen, Dhiraj Joshi
-
Patent number: 10198635Abstract: Systems and methods disclosed herein associate images with business venues. An example method includes: receiving a first image and retrieving textual reviews and stored images that are associated with one or more candidate business venues. The method further includes: detecting, using trained visual detectors, a plurality of business-aware concepts in the first image and assessing likelihood that detected business-aware concepts are in the first image. The method additionally includes: (i) generating a first representation of the first image based on the likelihoods and one or more term vectors for high-scoring concepts and (ii) receiving second representations of each candidate based on the retrieved textual reviews and stored images. In accordance with determining that the first representation is most similar to a respective second representation of a first candidate, the method includes: (i) associating the first image with the first candidate and (ii) providing an indication of the association.Type: GrantFiled: January 19, 2016Date of Patent: February 5, 2019Assignee: FUJI XEROX CO., LTD.Inventors: Bor-Chun Chen, Yin-Ying Chen, Francine R. Chen, Dhiraj Joshi
-
Patent number: 10089392Abstract: A method for automatically selecting thematically representative music is disclosed. A processor is used for using a theme-related keyword to search a keyword-indexed video repository to retrieve videos associated with the theme-related keyword; analyzing the retrieved videos to select videos with music; and extracting music tracks and features from the selected videos. The method further includes selecting representative music related to the theme from the extracted music tracks using the extracted features; and storing the selected representative music in a processor accessible memory.Type: GrantFiled: July 28, 2015Date of Patent: October 2, 2018Assignee: KODAK ALARIS INC.Inventors: Jiebo Luo, Dhiraj Joshi, Charles Parker