Patents by Inventor Michael Malakates

Michael Malakates has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11269934
    Abstract: Disclosed herein are techniques related to automated generation of regular expressions. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence.
    Type: Grant
    Filed: June 11, 2019
    Date of Patent: March 8, 2022
    Assignee: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Patent number: 11263247
    Abstract: Disclosed herein are techniques related to automated generation of regular expressions. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence.
    Type: Grant
    Filed: June 11, 2019
    Date of Patent: March 1, 2022
    Assignee: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Patent number: 11120086
    Abstract: Techniques are disclosed for toponym disambiguation. Toponym disambiguation can be performed for a set of geographic location data, such as placenames. A subset of the data and additional location information associated with the subset of the data can be initially determined. The remaining geographic location data in the set of geographic location data can be scored in order to determined additional location information for the remaining geographic location data. Additional location information for the remaining geographic location data can be determined based on calculated scores.
    Type: Grant
    Filed: February 11, 2019
    Date of Patent: September 14, 2021
    Assignee: Oracle International Corporation
    Inventors: Luis E. Rivas, Michael Malak, Mark L. Kreider
  • Publication number: 20210056264
    Abstract: Techniques are provided for identifying attributes associated with a neologism or an unknown word or name. Real world characteristics can be predicted for the neologism. Trigrams are identified for an input word and word embedding model vector values are calculated for the identified trigrams and entered into a matrix. Trigrams are identified for nearest names. Classification values are calculated based on the trigrams for the input word and the trigrams from the nearest names and the classification values are entered into the matrix. A convolutional neural network can process the matrix to identify one or more characteristics associated with the neologism.
    Type: Application
    Filed: August 4, 2020
    Publication date: February 25, 2021
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Michael Malak, Luis E. Rivas, Mark Lee Kreider
  • Patent number: 10885056
    Abstract: Techniques are disclosed for standardization of data. According to a first technique, standard representation terms are determined for to-be-standardized data using the to-be-standardized data itself and without using any external reference data. According to a second technique, a combination of the to-be-standardized data and an external reference is used to determine standard representation terms for the to-be-standardized data.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: January 5, 2021
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider, Philip Ogren, Robert James Oberbreckling
  • Publication number: 20200394478
    Abstract: Techniques are provided for performing sentiment analysis on words in a first data set. An example embodiment includes generating a word embedding model including a first plurality of features. A value indicating sentiment for the words in the first data set can be determined using a convolutional neural network (CNN). A second plurality of features are generated based on bigrams identified in the data set. The bigrams can be generated using a co-occurrence graph. The model is updated to include the second plurality of features, and sentiment analysis can be performed on a second data set using the updated model.
    Type: Application
    Filed: August 28, 2020
    Publication date: December 17, 2020
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Mark L. Kreider
  • Patent number: 10810472
    Abstract: Techniques are provided for performing sentiment analysis on words in a first data set. An example embodiment includes generating a word embedding model including a first plurality of features. A value indicating sentiment for the words in the first data set can be determined using a convolutional neural network (CNN). A second plurality of features are generated based on bigrams identified in the data set. The bigrams can be generated using a co-occurrence graph. The model is updated to include the second plurality of features, and sentiment analysis can be performed on a second data set using the updated model.
    Type: Grant
    Filed: May 10, 2018
    Date of Patent: October 20, 2020
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Michael Malak, Mark L. Kreider
  • Publication number: 20200320092
    Abstract: Techniques for generated regular expressions are disclosed. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence. A negative example may be used to generate the regular expression. Context from the negative example may be determined in order to generate the regular expression.
    Type: Application
    Filed: June 17, 2020
    Publication date: October 8, 2020
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Publication number: 20200320068
    Abstract: Techniques for generated regular expressions are disclosed. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence. Generation of the regular expressions can be implemented on an interactive user interface. Commands can be applied to the one or more character sequences and regular expressions are generated based on the applied commands.
    Type: Application
    Filed: June 17, 2020
    Publication date: October 8, 2020
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Publication number: 20200320142
    Abstract: Techniques for generated regular expressions are disclosed. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence. Alignment of span data structures may be performed when generating the regular expression.
    Type: Application
    Filed: June 17, 2020
    Publication date: October 8, 2020
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Publication number: 20200242111
    Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.
    Type: Application
    Filed: March 23, 2020
    Publication date: July 30, 2020
    Applicant: Oracle International Corporation
    Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray
  • Patent number: 10650000
    Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.
    Type: Grant
    Filed: September 14, 2017
    Date of Patent: May 12, 2020
    Assignee: Oracle International Corporation
    Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray
  • Publication number: 20190384763
    Abstract: Disclosed herein are techniques related to automated generation of regular expressions. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence.
    Type: Application
    Filed: June 11, 2019
    Publication date: December 19, 2019
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Publication number: 20190384772
    Abstract: Disclosed herein are techniques related to automated generation of regular expressions. In some embodiments, a regular expression generator may receive input data comprising one or more character sequences. The regular expression generator may convert character sequences into a sets of regular expression codes and/or span data structures. The regular expression generator may identify a longest common subsequence shared by the sets of regular expression codes and/or spans, and may generate a regular expression based upon the longest common subsequence.
    Type: Application
    Filed: June 11, 2019
    Publication date: December 19, 2019
    Applicant: Oracle International Corporation
    Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider
  • Patent number: D924175
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: July 6, 2021
    Assignee: HOMEWERKS WORLDWIDE, LLC
    Inventors: Peter D. Berkman, Michael Malaker
  • Patent number: D932611
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: October 5, 2021
    Assignee: Homewerks Worldwide, LLC
    Inventors: Peter D. Berkman, Michael Malaker
  • Patent number: D932612
    Type: Grant
    Filed: November 26, 2019
    Date of Patent: October 5, 2021
    Assignee: Homewerks Worldwide, LLC
    Inventors: Michael Malaker, Huang Xiaoying, Qui Xiwen, Li Xiaoshan, He Liting
  • Patent number: D933194
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: October 12, 2021
    Assignee: Homewerks Worldwide, LLC
    Inventors: Peter D. Berkman, Michael Malaker
  • Patent number: D933195
    Type: Grant
    Filed: November 26, 2019
    Date of Patent: October 12, 2021
    Assignee: Homewerks Worldwide, LLC
    Inventors: Michael Malaker, Huang Xiaoying, Jeffrey Jon Pischke, Zhang Ruilin
  • Patent number: D933809
    Type: Grant
    Filed: November 26, 2019
    Date of Patent: October 19, 2021
    Assignee: Homewerks Worldwide, LLC
    Inventors: Michael Malaker, Huang Xiaoying, Qui Xiwen, Li Xiaoshan, He Liting