Patents by Inventor Thomas Kollar
Thomas Kollar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240171724Abstract: The present disclosure provides neural fields for sparse novel view synthesis of outdoor scenes. Given just a single or a few input images from a novel scene, the disclosed technology can render new 360° views of complex unbounded outdoor scenes. This can be achieved by constructing an image-conditional triplanar representation to model the 3D surrounding from various perspectives. The disclosed technology can generalize across novel scenes and viewpoints for complex 360° outdoor scenes.Type: ApplicationFiled: October 16, 2023Publication date: May 23, 2024Applicants: TOYOTA RESEARCH INSTITUTE, INC., TOYOTA JIDOSHA KABUSHIKI KAISHAInventors: MUHAMMAD ZUBAIR IRSHAD, SERGEY ZAKHAROV, KATHERINE Y. LIU, VITOR GUIZILINI, THOMAS KOLLAR, ADRIEN D. GAIDON, RARES A. AMBRUS
-
Publication number: 20230401721Abstract: A method for 3D object perception is described. The method includes extracting features from each image of a synthetic stereo pair of images. The method also includes generating a low-resolution disparity image based on the features extracted from each image of the synthetic stereo pair images. The method further includes predicting, by a trained neural network, a feature map based on the low-resolution disparity image and one of the synthetic stereo pair of images. The method also includes generating, by a perception prediction head, a perception prediction of a detected 3D object based on the feature map predicted by the trained neural network.Type: ApplicationFiled: June 13, 2022Publication date: December 14, 2023Applicants: TOYOTA RESEARCH INSTITUTE, INC., TOYOTA JIDOSHA KABUSHIKI KAISHAInventors: Thomas KOLLAR, Kevin STONE, Michael LASKEY, Mark Edward TJERSLAND
-
Publication number: 20230398696Abstract: A robotic system is contemplated. The robotic system comprises a robot comprising a camera, a microphone, memory, and a controller that is configured to receive a natural language command for performing an action within a real world environment, parse the natural language command, categorize the action as being associated with guidance for performing the action, receive the guidance for performing the action, the guidance including a motion applied to at least one portion of the robot within the real world environment for performing the action, and store, in the memory, the natural language command in correlation with the motion that is applied to the at least one portion of the robot.Type: ApplicationFiled: June 14, 2022Publication date: December 14, 2023Applicants: Toyota Research Institute, Inc., Toyota Jidosha Kabushiki KaishaInventor: Thomas Kollar
-
Publication number: 20230398692Abstract: A method for training a neural network to perform 3D object manipulation is described. The method includes extracting features from each image of a synthetic stereo pair of images. The method also includes generating a low-resolution disparity image based on the features extracted from each image of the synthetic stereo pair of images. The method further includes generating, by the neural network, a feature map based on the low-resolution disparity image and one of the synthetic stereo pair of images. The method also includes manipulating an unknown object perceived from the feature map according to a perception prediction from a prediction head.Type: ApplicationFiled: June 13, 2022Publication date: December 14, 2023Applicants: TOYOTA RESEARCH INSTITUTE, INC., TOYOTA JIDOSHA KABUSHIKI KAISHAInventors: Thomas KOLLAR, Kevin STONE, Michael LASKEY, Mark Edward TJERSLAND
-
Publication number: 20230077856Abstract: System, methods, and other embodiments described herein relate to single-shot multi-object three-dimensional (3D) shape reconstruction and categorical six-dimensional (6D) pose and size estimation. In one embodiment, a method includes inferring a heatmap based upon a feature pyramid, where the feature pyramid is generated based upon a red green blue depth (RGB-D) image that includes objects. The method further includes sampling a 3D parameter map at locations corresponding to peaks in the heatmap, where the 3D parameter map is inferred based upon the feature pyramid, and where the locations include latent shape codes, 6D poses, and one-dimensional (1D) scales. The method further includes generating point clouds based upon the latent shape codes, the 6D poses, and the 1D scales.Type: ApplicationFiled: August 25, 2022Publication date: March 16, 2023Applicants: Toyota Research Institute, Inc., Toyota Jidosha Kabushiki KaishaInventors: Muhammad Zubair Irshad, Thomas Kollar, Michael Laskey, Kevin Stone
-
Publication number: 20230032575Abstract: A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.Type: ApplicationFiled: August 8, 2022Publication date: February 2, 2023Inventors: Cengiz Erbas, Thomas Kollar, Avnish Sikka, Spyridon Matsoukas, Simon Peter Reavely
-
Patent number: 11410646Abstract: A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.Type: GrantFiled: March 28, 2019Date of Patent: August 9, 2022Assignee: Amazon Technologies, Inc.Inventors: Cengiz Erbas, Thomas Kollar, Avnish Sikka, Spyridon Matsoukas, Simon Peter Reavely
-
Patent number: 10304444Abstract: A system capable of performing natural language understanding (NLU) without the concept of a domain that influences NLU results. The present system uses a hierarchical organizations of intents/commands and entity types, and trained models associated with those hierarchies, so that commands and entity types may be determined for incoming text queries without necessarily determining a domain for the incoming text. The system thus operates in a domain agnostic manner, in a departure from multi-domain architecture NLU processing where a system determines NLU results for multiple domains simultaneously and then ranks them to determine which to select as the result.Type: GrantFiled: June 29, 2016Date of Patent: May 28, 2019Assignee: Amazon Technologies, Inc.Inventors: Lambert Mathias, Thomas Kollar, Arindam Mandal, Angeliki Metallinou
-
Publication number: 20170278514Abstract: A system capable of performing natural language understanding (NLU) without the concept of a domain that influences NLU results. The present system uses a hierarchical organizations of intents/commands and entity types, and trained models associated with those hierarchies, so that commands and entity types may be determined for incoming text queries without necessarily determining a domain for the incoming text. The system thus operates in a domain agnostic manner, in a departure from multi-domain architecture NLU processing where a system determines NLU results for multiple domains simultaneously and then ranks them to determine which to select as the result.Type: ApplicationFiled: June 29, 2016Publication date: September 28, 2017Inventors: Lambert Mathias, Thomas Kollar, Arindam Mandal, Angeliki Metallinou
-
Publication number: 20100247485Abstract: Described herein are formulations and devices for delivering compounds to arthropods and microorganisms within the arthropods. The formulations are generally composed of a sugar and the compound, wherein the compound targets a particular pathogen or other microorganism within the arthropod, kills the arthropod, or a combination thereof.Type: ApplicationFiled: September 5, 2008Publication date: September 30, 2010Applicant: MEVLABS, INC.Inventor: Thomas Kollars
-
Publication number: 20060044111Abstract: A system for tracking and reporting data using RFID technology includes an article and a radio frequency identification tag attached to the article. The tag has an identifier associating the tag with the article and containing data representative of information about the article. A reader senses the presence of the identification tag and reads the identifier information and the data. An operations computer is in communication with the reader for receiving from the reader, in real time, the identifier information and the data, recording the identifier information and the data, and generating output data regarding the article. At least one workstation remote from the operations computer an in communication with the operations computer is able to access the output data generated by the operations computer.Type: ApplicationFiled: February 24, 2003Publication date: March 2, 2006Applicant: JAFA Technologies., Inc.,Inventors: Thomas Kollar, Carol Bozarth, Edward Mulka
-
Publication number: 20050238713Abstract: An insect and/or arthropod trapping device that generates its own attractants of carbon dioxide (CO2), and ammonia through the chemical reaction of adding a weakly acidic liquid such as vinegar (acetic acid) to solids such as baking soda (sodium bicarbonate), with the optional addition of urea and/or lactic acid. The liquids are mixed over a period of days onto the solids to generate CO2 in the vicinity of an insect/arthropod trap having glue boards that trap the insects and arthropods when they alight on the glue board. The attractants can be used with devices that utilize various combinations of other insect attractants and traps such as sound, light, scent, visual, electrical, chemical, sticky surfaces, mesh nets, etc., to further attract and trap or kill insects and/or arthropods.Type: ApplicationFiled: June 29, 2005Publication date: October 27, 2005Applicant: Ticks or Mosquitoes, L.L.C.Inventors: Thomas Kollars, Edwin Masters, Jacqueline Masters, Peggy Kollars