Patents by Inventor Stefan Krawczyk
Stefan Krawczyk has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240078386Abstract: Methods, apparatuses, and systems are presented for generating natural language models using a novel system architecture for feature extraction. A method for extracting features for natural language processing comprises: accessing one or more tokens generated from a document to be processed; receiving one or more feature types defined by user; receiving selection of one or more feature types from a plurality of system-defined and user-defined feature types, wherein each feature type comprises one or more rules for generating features; receiving one or more parameters for the selected feature types, wherein the one or more rules for generating features are defined at least in part by the parameters; generating features associated with the document to be processed based on the selected feature types and the received parameters; and outputting the generated features in a format common among all feature types.Type: ApplicationFiled: November 2, 2023Publication date: March 7, 2024Applicant: 100.co Global Holdings, LLCInventors: Robert J. Munro, Schuyler D. Erle, Tyler J. Schnoebelen, Brendan D. Callahan, Jessica D. Long, Gary C. King, Paul A. Tepper, Jason A. Brenier, Stefan Krawczyk
-
Patent number: 11675977Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: GrantFiled: March 27, 2020Date of Patent: June 13, 2023Assignee: Daash Intelligence, Inc.Inventors: Robert J. Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Patent number: 11288444Abstract: Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: selecting from a pool of documents, a first set of documents to be annotated; receiving annotations of the first set of documents elicited by first human readable prompts; training a natural language model using the annotated first set of documents; determining documents in the pool having uncertain natural language processing results according to the trained natural language model and/or the received annotations; selecting from the pool of documents, a second set of documents to be annotated comprising documents having uncertain natural language processing results; receiving annotations of the second set of documents elicited by second human readable prompts; and retraining a natural language model using the annotated second set of documents.Type: GrantFiled: December 11, 2020Date of Patent: March 29, 2022Assignee: 100.co, LLCInventors: Robert J. Munro, Schuyler D. Erle, Jason Brenier, Paul A. Tepper, Tripti Saxena, Gary C. King, Jessica D. Long, Brendan D. Callahan, Tyler J. Schnoebelen, Stefan Krawczyk, Veena Basavaraj
-
Publication number: 20210232761Abstract: Systems and methods are presented for providing improved machine performance in natural language processing. In some example embodiments, an API module is presented that is configured to drive processing of a system architecture for natural language processing. Aspects of the present disclosure allow for a natural language model to classify documents while other documents are being retrieved in real time. The natural language model and the documents are configured to be stored in a stateless format, which also allows for additional functions to be performed on the documents while the natural language model is used to continue classifying other documents.Type: ApplicationFiled: December 22, 2020Publication date: July 29, 2021Inventors: Schuyler D. Erle, Robert J. Munro, Brendan D. Callahan, Jason Brenier, Paul A. Tepper, Jessica D. Long, James B. Robinson, Aneesh Nair, Michelle Casbon, Stefan Krawczyk
-
Publication number: 20210232760Abstract: Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: selecting from a pool of documents, a first set of documents to be annotated; receiving annotations of the first set of documents elicited by first human readable prompts; training a natural language model using the annotated first set of documents; determining documents in the pool having uncertain natural language processing results according to the trained natural language model and/or the received annotations; selecting from the pool of documents, a second set of documents to be annotated comprising documents having uncertain natural language processing results; receiving annotations of the second set of documents elicited by second human readable prompts; and retraining a natural language model using the annotated second set of documents.Type: ApplicationFiled: December 11, 2020Publication date: July 29, 2021Inventors: Robert J. Munro, Schuyler D. Erle, Jason Brenier, Paul A. Tepper, Tripti Saxena, Gary C. King, Jessica D. Long, Brendan D. Callahan, Tyler J. Schnoebelen, Stefan Krawczyk, Veena Basavaraj
-
Publication number: 20210157984Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: ApplicationFiled: March 27, 2020Publication date: May 27, 2021Inventors: Robert J. Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Publication number: 20210081611Abstract: Methods, apparatuses, and systems are presented for generating natural language models using a novel system architecture for feature extraction. A method for extracting features for natural language processing comprises: accessing one or more tokens generated from a document to be processed; receiving one or more feature types defined by user; receiving selection of one or more feature types from a plurality of system-defined and user-defined feature types, wherein each feature type comprises one or more rules for generating features; receiving one or more parameters for the selected feature types, wherein the one or more rules for generating features are defined at least in part by the parameters; generating features associated with the document to be processed based on the selected feature types and the received parameters; and outputting the generated features in a format common among all feature types.Type: ApplicationFiled: April 29, 2020Publication date: March 18, 2021Applicant: Singapore Biotech PTE. LTD.Inventors: Robert J. Munro, Schuyler D. Erle, Tyler J. Schnoebelen, Brendan D. Callahan, Jessica D. Long, Gary C. King, Paul A. Tepper, Jason A. Brenier, Stefan Krawczyk
-
Publication number: 20200234002Abstract: Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: selecting from a pool of documents, a first set of documents to be annotated; receiving annotations of the first set of documents elicited by first human readable prompts; training a natural language model using the annotated first set of documents; determining documents in the pool having uncertain natural language processing results according to the trained natural language model and/or the received annotations; selecting from the pool of documents, a second set of documents to be annotated comprising documents having uncertain natural language processing results; receiving annotations of the second set of documents elicited by second human readable prompts; and retraining a natural language model using the annotated second set of documents.Type: ApplicationFiled: November 21, 2018Publication date: July 23, 2020Inventors: Robert J. Munro, Schuyler D. Erle, Jason Brenier, Paul A. Tepper, Tripti Saxena, Gary C. King, Jessica D. Long, Brendan D. Callahan, Tyler J. Schnoebelen, Stefan Krawczyk, Veena Basavaraj
-
Publication number: 20190377788Abstract: Methods, apparatuses, and systems are presented for generating natural language models using a novel system architecture for feature extraction. A method for extracting features for natural language processing comprises: accessing one or more tokens generated from a document to be processed; receiving one or more feature types defined by user; receiving selection of one or more feature types from a plurality of system-defined and user-defined feature types, wherein each feature type comprises one or more rules for generating features; receiving one or more parameters for the selected feature types, wherein the one or more rules for generating features are defined at least in part by the parameters; generating features associated with the document to be processed based on the selected feature types and the received parameters; and outputting the generated features in a format common among all feature types.Type: ApplicationFiled: January 2, 2019Publication date: December 12, 2019Applicant: AIPARC HOLDINGS PTE. LTD.Inventors: Robert J. Munro, Schuyler D. Erle, Tyler J. Schnoebelen, Brendan D. Callahan, Jessica D. Long, Gary C. King, Paul A. Tepper, Jason A. Brenier, Stefan Krawczyk
-
Publication number: 20190243886Abstract: Systems and methods are presented for providing improved machine performance in natural language processing. In some example embodiments, an API module is presented that is configured to drive processing of a system architecture for natural language processing. Aspects of the present disclosure allow for a natural language model to classify documents while other documents are being retrieved in real time. The natural language model and the documents are configured to be stored in a stateless format, which also allows for additional functions to be performed on the documents while the natural language model is used to continue classifying other documents.Type: ApplicationFiled: September 7, 2018Publication date: August 8, 2019Applicant: Idibon, Inc.Inventors: Schuyler D. Erle, Robert J. Munro, Brendan D. Callahan, Jason Brenier, Paul A. Tepper, Jessica D. Long, James B. Robinson, Aneesh Nair, Michelle Casbon, Stefan Krawczyk
-
Publication number: 20190205377Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: ApplicationFiled: August 6, 2018Publication date: July 4, 2019Applicant: Idibon, Inc.Inventors: Robert J. Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Publication number: 20180157636Abstract: Methods, apparatuses, and systems are presented for generating natural language models using a novel system architecture for feature extraction. A method for extracting features for natural language processing comprises: accessing one or more tokens generated from a document to be processed; receiving one or more feature types defined by user; receiving selection of one or more feature types from a plurality of system-defined and user-defined feature types, wherein each feature type comprises one or more rules for generating features; receiving one or more parameters for the selected feature types, wherein the one or more rules for generating features are defined at least in part by the parameters; generating features associated with the document to be processed based on the selected feature types and the received parameters; and outputting the generated features in a format common among all feature types.Type: ApplicationFiled: November 15, 2017Publication date: June 7, 2018Applicant: Idibon, Inc.Inventors: Robert J. Munro, Schuyler D. Erle, Tyler j. Schnoebelen, Brendan D. Callahan, Jessica D. Long, Gary C. King, Paul A. Tepper, Jason A. Brenier, Stefan Krawczyk
-
Patent number: 9965458Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: GrantFiled: December 9, 2015Date of Patent: May 8, 2018Assignee: Sansa AI Inc.Inventors: Robert J. Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Publication number: 20180095946Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: ApplicationFiled: May 16, 2017Publication date: April 5, 2018Applicant: Idibon, Inc.Inventors: Robert Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Publication number: 20160162466Abstract: Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.Type: ApplicationFiled: December 9, 2015Publication date: June 9, 2016Applicant: Idibon, Inc.Inventors: Robert J. Munro, Rob Voigt, Schuyler D. Erle, Brendan D. Callahan, Gary C. King, Jessica D. Long, Jason Brenier, Tripti Saxena, Stefan Krawczyk
-
Publication number: 20160162569Abstract: Systems and methods are presented for providing improved machine performance in natural language processing. In some example embodiments, an API module is presented that is configured to drive processing of a system architecture for natural language processing. Aspects of the present disclosure allow for a natural language model to classify documents while other documents are being retrieved in real time. The natural language model and the documents are configured to be stored in a stateless format, which also allows for additional functions to be performed on the documents while the natural language model is used to continue classifying other documents.Type: ApplicationFiled: December 9, 2015Publication date: June 9, 2016Applicant: Idibon, Inc.Inventors: Schuyler D. Erle, Robert J. Munro, Brendan D. Callahan, Jason Brenier, Paul A. Tepper, Jessica D. Long, James B. Robinson, Aneesh Nair, Michelle Casbon, Stefan Krawczyk
-
Publication number: 20160162467Abstract: Methods, apparatuses, and systems are presented for generating natural language models using a novel system architecture for feature extraction. A method for extracting features for natural language processing comprises: accessing one or more tokens generated from a document to be processed; receiving one or more feature types defined by user; receiving selection of one or more feature types from a plurality of system-defined and user-defined feature types, wherein each feature type comprises one or more rules for generating features; receiving one or more parameters for the selected feature types, wherein the one or more rules for generating features are defined at least in part by the parameters; generating features associated with the document to be processed based on the selected feature types and the received parameters; and outputting the generated features in a format common among all feature types.Type: ApplicationFiled: December 9, 2015Publication date: June 9, 2016Applicant: Idibon, Inc.Inventors: Robert J. Munro, Schuyler D. Erle, Tyler J. Schnoebelen, Brendan D. Callahan, Jessica D. Long, Gary C. King, Paul A. Tepper, Jason Brenier, Stefan Krawczyk
-
Publication number: 20160162457Abstract: Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: selecting from a pool of documents, a first set of documents to be annotated; receiving annotations of the first set of documents elicited by first human readable prompts; training a natural language model using the annotated first set of documents; determining documents in the pool having uncertain natural language processing results according to the trained natural language model and/or the received annotations; selecting from the pool of documents, a second set of documents to be annotated comprising documents having uncertain natural language processing results; receiving annotations of the second set of documents elicited by second human readable prompts; and retraining a natural language model using the annotated second set of documents.Type: ApplicationFiled: December 9, 2015Publication date: June 9, 2016Applicant: Idibon, Inc.Inventors: Robert J. Munro, Schuyler D. Erle, Jason Brenier, Paul A. Tepper, Tripti Saxena, Gary C. King, Jessica D. Long, Brendan D. Callahan, Tyler J. Schnoebelen, Stefan Krawczyk, Veena Basavaraj
-
Patent number: 8676583Abstract: An action is performed in a spoken dialog system in response to a user's spoken utterance. A policy which maps belief states of user intent to actions is retrieved or created. A belief state is determined based on the spoken utterance, and an action is selected based on the determined belief state and the policy. The action is performed, and in one embodiment, involves requesting clarification of the spoken utterance from the user. Creating a policy may involve simulating user inputs and spoken dialog system interactions, and modifying policy parameters iteratively until a policy threshold is satisfied. In one embodiment, a belief state is determined by converting the spoken utterance into text, assigning the text to one or more dialog slots associated with nodes in a probabilistic ontology tree (POT), and determining a joint probability based on probability distribution tables in the POT and on the dialog slot assignments.Type: GrantFiled: August 30, 2011Date of Patent: March 18, 2014Assignee: Honda Motor Co., Ltd.Inventors: Rakesh Gupta, Deepak Ramachandran, Antoine Raux, Neville Mehta, Stefan Krawczyk, Matthew Hoffman
-
Publication number: 20120053945Abstract: An action is performed in a spoken dialog system in response to a user's spoken utterance. A policy which maps belief states of user intent to actions is retrieved or created. A belief state is determined based on the spoken utterance, and an action is selected based on the determined belief state and the policy. The action is performed, and in one embodiment, involves requesting clarification of the spoken utterance from the user. Creating a policy may involve simulating user inputs and spoken dialog system interactions, and modifying policy parameters iteratively until a policy threshold is satisfied. In one embodiment, a belief state is determined by converting the spoken utterance into text, assigning the text to one or more dialog slots associated with nodes in a probabilistic ontology tree (POT), and determining a joint probability based on probability distribution tables in the POT and on the dialog slot assignments.Type: ApplicationFiled: August 30, 2011Publication date: March 1, 2012Applicant: HONDA MOTOR CO., LTD.Inventors: Rakesh Gupta, Deepak Ramachandran, Antoine Raux, Neville Mehta, Stefan Krawczyk, Matthew Hoffman