Patents by Inventor Nikolaos Koudas

Nikolaos Koudas has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20050027717
    Abstract: An organization's data records are often noisy: because of transcription errors, incomplete information, and lack of standard formats for textual data. A fundamental task during data cleansing and integration is matching strings—perhaps across multiple relations—that refer to the same entity (e.g., organization name or address). Furthermore, it is desirable to perform this matching within an RDBMS, which is where the data is likely to reside. In this paper, We adapt the widely used and established cosine similarity metric from the information retrieval field to the relational database context in order to identify potential string matches across relations. We then use this similarity metric to characterize this key aspect of data cleansing and integration as a join between relations on textual attributes, where the similarity of matches exceeds a specified threshold. Computing an exact answer to the text join can be expensive.
    Type: Application
    Filed: April 21, 2004
    Publication date: February 3, 2005
    Inventors: Nikolaos Koudas, Divesh Srivastava, Luis Gravano, Panagiotis Ipeirotis
  • Patent number: 6738762
    Abstract: An approach for multidimensional substring selectivity estimation utilizes set hashing to generate cross-counts as needed, instead of storing cross-counts for the most frequently co-occurring substrings. Set hashing is a Monte Carlo technique that is used to succinctly represent the set of tuples containing a given substring. Then, any combination of set hashes will yield a cross-count when intersected. Thus, the set hashing technique is useful in three-, four- and other multidimensional situations, since only an intersection function is required.
    Type: Grant
    Filed: November 26, 2001
    Date of Patent: May 18, 2004
    Assignee: AT&T Corp.
    Inventors: Zhiyuan Chen, Philip Russell Korn, Nikolaos Koudas, Shanmugavelayutham Muthukrishnan