From Unstructured Or Semi-structured Data To Structured Data Patents (Class 707/811)
  • Patent number: 7873680
    Abstract: A method for providing processed data definition documents (DDDs) or processed document object models (DOMs) for object oriented programming. The use of these processed data definitions simplifies the data structures and streamlines programming to access the data. A standard DDD/DOM has a hierarchical branched structure having a number of levels each with elements/nodes and attributes. The DDD is written in a platform independent markup language. An element/node is selected and its attributes are identified. All ‘children’ of the selected element/node are identified. The attributes of the selected element/node (parent) are then copied to each child for all children in the DDD/DOM. This is repeated for all elements/nodes in the DDD/DOM to result in a processed DDD/processed DOM which is now structured to allow program access to data in a more direct manner.
    Type: Grant
    Filed: June 13, 2008
    Date of Patent: January 18, 2011
    Assignee: International Business Machines Corporation
    Inventor: Chad L. Meadows
  • Patent number: 7865491
    Abstract: The present invention provides systems and articles of manufacture that enhance the capability of a database abstraction model and query application constructed for an underlying physical database. Typically, the query application is used to compose and execute an abstract query. Once an initial query result is presented to a user, a user may select to execute a model entity operation by interacting with a query interface of the query application. A model entity operation allows the user to retrieve additional information from the underlying database, based on information included in the initial query result, without having to create a new query or having to correlate the results of multiple queries.
    Type: Grant
    Filed: October 12, 2009
    Date of Patent: January 4, 2011
    Assignee: International Business Machines Corporation
    Inventors: Richard D. Dettinger, Daniel P. Kolz
  • Patent number: 7860872
    Abstract: A web-based media analysis system, consisting of automated media analysis and document management tools, which processes news articles by parsing the news contents or documents and assigning, relating, and extracting information from the news contents for media analysis and relationally storing them in at least one database. The system further comprises a toning engine for toning articles accurately, based on words, attributes and categories of the article, and optionally based on the author of the article, if applicable.
    Type: Grant
    Filed: January 29, 2007
    Date of Patent: December 28, 2010
    Assignee: NIKIP Technology Ltd.
    Inventors: Brett Serjeantson, Paul Williamson
  • Publication number: 20100306260
    Abstract: Numbered sequences detection includes (i) extracting one or more numbered item token patterns from a document comprising an ordered sequence of text units, each numbered item token pattern including an incremental portion and a fixed portion that matches at least one text unit of the document and (ii) identifying at least one numbered sequence in the document conforming with a matching numbered item token pattern of the extracted one or more numbered item token patterns. The identified at least one numbered sequence comprises an ordered sub-sequence of text units of the document that match the matching numbered item token pattern. The detection may further comprise determining that a second type of numbered sequence nests in the document between consecutive text units belonging to a numbered sequence of a first type, and optimizing one or more numbered sequences of the second type based on information provided by the determining.
    Type: Application
    Filed: May 29, 2009
    Publication date: December 2, 2010
    Applicant: Xerox Corporation
    Inventor: Herve Dejean
  • Publication number: 20100299334
    Abstract: A computer implemented system and method for providing a computer and collaboration platform around knowledge transfer, expertise, innovation, tangible, intangible and information assets are disclosed. The system converts a static expert content into an active forum in an effective manner to promote collaboration among users in the various categories. The system converts the static content into one or more sections according to a parsing rule. Each section is assigned with one or more categories and one or more plug-ins for forming a framework. The system captures the framework as a model and generates one or more data partnering objects, and stores them in a database. Pursuant to a user's request and attributes of the user, appropriate one or more of the data partnering objects are retrieved from the database to generate an active forum. The system, then, publishes the active forum.
    Type: Application
    Filed: September 8, 2009
    Publication date: November 25, 2010
    Inventors: Greg Waite, Jason Clark
  • Patent number: 7836104
    Abstract: According to some embodiments, demonstration data is received via a front-end application associated with a business information enterprise system. The demonstration data may then be interpreted in accordance with at least one rule to generate business data. A query may be received at a back-end application associated with the business information enterprise system. At least a portion of the business data may then be presented in accordance with the received query.
    Type: Grant
    Filed: June 3, 2005
    Date of Patent: November 16, 2010
    Assignee: SAP AG
    Inventors: Eric Schemer, Tanja B. Wingerter, Markus Ulke
  • Publication number: 20100281076
    Abstract: An assisting method and an assisting apparatus for accessing a markup language document are provided. First, an intermediate table is established in a storage unit, wherein the intermediate table includes a length field, a depth field, a type field, a parent element field, and an offset field. Then, structure data of each element in the markup language document is transformed into the intermediate table to respectively record a string length, a hierarchy depth, an element type, a parent element index, and an absolute position of the element into the length field, the depth field, the type field, the parent element field, and the offset field. Finally, access to the markup language document is assisted according to the intermediary table.
    Type: Application
    Filed: July 28, 2009
    Publication date: November 4, 2010
    Applicant: NATIONAL TAIWAN UNIVERSITY
    Inventors: Sheng-Wen Pan, Sheng-De Wang
  • Patent number: 7822788
    Abstract: A condition generating unit generates a hierarchical-type search condition including a search target structure ID and a search result structure ID. A first acquiring unit acquires an object ID corresponding to the search target structure ID to which a vocabulary index is not attached. A candidate generating unit generates a candidate of the search result in which an acquired object ID is associated with the search key as a first constraint condition. A second acquiring unit acquires a search result structure ID complying with a structure constraint. A result acquiring unit acquires an object corresponding to the object ID satisfying the first constraint condition.
    Type: Grant
    Filed: August 30, 2007
    Date of Patent: October 26, 2010
    Assignee: Kabushiki Kaisha Toshiba
    Inventor: Takuya Kanawa
  • Patent number: 7814124
    Abstract: An architecture and method is provided that facilitates serialization of a graph of objects into streams of data in an arbitrary format, and deserialization of the streams of data back into the graph of objects. The architecture provides a number of services associated with the basic functionality of serialization and deserialization. The services can be employed to implement transparent remoting, copy items to a clipboard and save data to a file. The present invention provides facilities which support the plugging in of a new serialization encoding by separating the encoding from the reading and reinstantiation of the graph of objects which the encoding describes. Objects in a graph of objects are serialized and deserialized based on a selected rule set for that object. A rule set can be provided by a class author within a class or within a third party file referred to as a surrogate.
    Type: Grant
    Filed: July 11, 2005
    Date of Patent: October 12, 2010
    Assignee: Microsoft Corporation
    Inventors: Stephen Peter de Jong, Gopala Krishna R. Kakivaya, Joseph L. Roxe
  • Patent number: 7809734
    Abstract: A system and method for transcoding digital content (e.g. web content) by correctly employing one annotation for multiple digital contents. This can efficiently reduce the workloads required for the addition of annotation data during the transcoding process. A transcoding system comprises an annotation database system for storing annotation data to be used for the transcoding of contents, and a transcoder for transcoding the contents based on annotation data stored in the annotation database system. Upon receiving an inquiry from the transcoder, a correlation between elements in the contents and descriptions of the annotation data is checked to select one annotation that can be employed for transcoding the content. The correlation is specifically determined based on XPath information.
    Type: Grant
    Filed: June 10, 2008
    Date of Patent: October 5, 2010
    Assignee: International Business Machines Corporation
    Inventors: Takashi Itoh, Hironobu Takagi
  • Patent number: 7788289
    Abstract: A system records information relating to performing a logical activity on a group of devices. The information includes information transmitted to each device in the group of devices and information received from each device in the group of devices. The system also uses the recorded information for troubleshooting purposes.
    Type: Grant
    Filed: December 12, 2005
    Date of Patent: August 31, 2010
    Assignee: Verizon Business Global LLC
    Inventors: Paul M. Golobay, John M. Hahs, Hieu V. Mai, Kelvin R. Russell, Parker C. Webb
  • Patent number: 7779051
    Abstract: A method for creating a data warehousing scheme having optimally selected components. A mathematical model of a goal for the data warehousing scheme is input into an optimization engine. At least one constraint on the data warehousing scheme is input into the optimization engine. A mathematical optimization algorithm is performed using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components. The optimized data warehousing scheme can be stored.
    Type: Grant
    Filed: January 2, 2008
    Date of Patent: August 17, 2010
    Assignee: International Business Machines Corporation
    Inventors: Robert R. Friedlander, James R. Kraemer
  • Patent number: 7779050
    Abstract: A method and system to process a domain. A domain is modeled with one or more domain models. Support models are generated from domain models. An ontological system utilizes the support models to interact with and manipulate the domain models. Further, the support models are used to describe domain model states.
    Type: Grant
    Filed: April 5, 2006
    Date of Patent: August 17, 2010
    Assignee: Microsoft Corporation
    Inventors: Dan Adamson, Leo Shih, Alain T. Rappaport
  • Patent number: 7779052
    Abstract: A system includes a relational database and processing logic. The relational database is configured to define a relationship between a group of logical activities and groups of physical commands that perform the logical activities. The processing logic is configured to receive a request to perform one logical activity of the group of logical activities, translate the one logical activity into one group of physical commands using the relational database, and cause the one logical activity to be performed on a remote device using the one group of physical commands.
    Type: Grant
    Filed: December 12, 2005
    Date of Patent: August 17, 2010
    Assignee: Verizon Business Global LLC
    Inventors: Paul M. Golobay, John M. Hahs, Hieu V. Mai, Kelvin R. Russell, Parker C. Webb
  • Publication number: 20100205208
    Abstract: A system and method are disclosed for locating, collecting, collating, analyzing, and reporting on summarized data that is generated from regulatory compliance matter data. Data is collected and compiled from multiple sources, including government databases, web pages, and regulatory documents. These collected data are consolidated and reconciled. A single topic record is created for a person, company, or product. Duplication and redundancy within the information are reduced. Corrections in data format are made for inconsistencies that exist between different information sources. Broad and/or narrow searches are enabled with the retrieval of information and/or relevant documents. The integrated data and associated analyses can be presented in reports that can be made accessible through a LAN, a WAN, a desktop, and/or a web interface.
    Type: Application
    Filed: April 22, 2010
    Publication date: August 12, 2010
    Applicant: GRAEMATTER, INC.
    Inventor: Melissa C. Walker
  • Patent number: 7774301
    Abstract: Provided are techniques for transforming unstructured information into content in a uniform context. The unstructured information and metadata associated with the unstructured information are extracted from one or more source content repositories. One or more custom transformations are performed on at least one of the unstructured information and the metadata. At least one of the transformed, unstructured information and the metadata are loaded into one or more target content repositories.
    Type: Grant
    Filed: December 21, 2006
    Date of Patent: August 10, 2010
    Assignee: International Business Machines Corporation
    Inventors: Sean Allen Johnson, Amisha Parikh, Angela Fagundes Reese, Ravishankar Sathyam, Clifford J. Vars, IV, Jedd Samuel Weise, Anderson Edward Wolfe
  • Patent number: 7774388
    Abstract: A method, system, and apparatus for identifying, describing, integrating, and discovering information events with the unique feature of applicability to and extensibility across event states irrespective of function and or embodiment in one system with one approach, one infrastructure, one architecture, one method, and one principled basis, comprising: a self-mint method for self-service identity; an information architecture; a method to organize everything; a scalable business process for integrating data from different tables and/or from different systems into one combined system; a programming process and language, operating system architecture, and modeling medium; and a search engine and directory to make it all accessible; altogether comprising an infrastructure for a network system.
    Type: Grant
    Filed: August 18, 2007
    Date of Patent: August 10, 2010
    Inventor: Margaret Runchey
  • Patent number: 7761484
    Abstract: Converting data to an appropriate format for use with a service. An example method is illustrated where a message including data expressed using dynamic language data expressions is received. The dynamic language data expressions include a tree structure organization for the data. The data expressed using dynamic language data expressions is expressed in an XML data structure. The XML data structure preserves the original tree structure organization for the data.
    Type: Grant
    Filed: February 9, 2007
    Date of Patent: July 20, 2010
    Assignee: Microsoft Corporation
    Inventors: Erik B. Christensen, Stephen J. Maine, Natasha Jethanandani, Krishnan Rangachari, Sowmyanarayanan K. Srinivasan, Eugene Osovetsky
  • Publication number: 20100174717
    Abstract: This invention concerns an iterative procedure for conversion of structured software objects into a raw data stream and vice versa, providing for their direct transfer using simple communication resources such as those of an embedded computer station, and reset of said software objects or reutilisation of memory space allocated to them. This procedure can be used by an embedded platform (2) or a portable object including at least a processor capable of exchanging information with a terminal in the form of linear data sequences. The procedure includes a step for conversion of a data set, in one direction or the other, between a linear data sequence arrangement on the one hand, and a structured arrangement describing or representing an object-oriented software object on the other hand.
    Type: Application
    Filed: February 26, 2003
    Publication date: July 8, 2010
    Inventors: Olivier Fambon, André Freyssinet, Serge Lacourte
  • Patent number: 7752238
    Abstract: The present invention provides a method for converting a searchable electronic catalog of the type used in e-commerce and industrial materiel systems. Such catalogs are typically configured as databases but can be created from a variety of different source materials. The method includes identifying a set of items to be converted, identifying the characteristics for each item, accessing the characteristic values for each identified item, accessing mapping rules for each characteristic and each item, mapping the characteristic values for each item in the first catalog into the characteristic identified by the rule for the item in the second catalog, and compiling the mapped characteristic values for each item to form the second catalog.
    Type: Grant
    Filed: April 10, 2003
    Date of Patent: July 6, 2010
    Assignee: Requisite Software Inc.
    Inventors: James Michael Wilmsen, Michael Renn Neal, Nathan Eric Wykes, Ian Straub
  • Patent number: 7734636
    Abstract: A system for classifying a genre of an electronic document may include a network processor configured to receive an electronic document and convert the electronic document to rich text format (RTF). The processor may be configured to parse the RTF document into lines of text ordered from top to bottom and left to right and assign tokens to each line of text based on content of the line and to line separators based on space between blocks of lines. The network processor may be configured to sequence the tokens, parse the tokenized document with a number of pre-defined document grammars, determine a probability for each genre corresponding to the electronic document, and classify the electronic document as the genre with the highest probability.
    Type: Grant
    Filed: March 31, 2005
    Date of Patent: June 8, 2010
    Assignee: Xerox Corporation
    Inventor: John C. Handley
  • Patent number: 7725503
    Abstract: A method of serializing and deserializing unknown data types in a strongly typed model. The method includes serializing an object to a data stream at first node and communicating the data stream to a second node. The second node may be another process, machine or a file on a disk. The data stream is deserialized at a later time, and the data types within the data stream are determined. Objects are instantiated in accordance with known data types, and unknown objects are created to retain information related to each unknown data type in the data stream. These unknown objects are used to regenerate the unknown data type when a serialization operation is performed at the second node on an unknown object.
    Type: Grant
    Filed: January 10, 2006
    Date of Patent: May 25, 2010
    Assignee: Microsoft Corporation
    Inventors: Caleb L. Doise, Gopalakrishna R. Kakivaya
  • Patent number: 7725504
    Abstract: The present invention provides a method and apparatus for helping a user form a structured diagram from an unstructured information source. Starting with one or more key information elements such as some special words, the requests of a customer contained in the information source can be obtained by performing interactive and iterative searching in the unstructured information source such as text, audio, video and etc., the artifacts representing them are drawn in the diagram, and linkages are established between the artifacts and the corresponding contents in the information source. The present invention also proposes that the distribution of established linkages can be used to check whether all the requests in the information source have been extracted in the diagram. Further, various levels of warnings can be shown according to the density of linkage distribution. Therefore the user can draw a structured diagram more conveniently and quickly, and can perform checking and reusing more easily.
    Type: Grant
    Filed: March 28, 2006
    Date of Patent: May 25, 2010
    Assignee: International Business Machines Corporation
    Inventors: Zhao Ming Qiu, Guo Tong Xie, Dong Liu, Gang Hu
  • Patent number: 7725499
    Abstract: A unified system for the structured collection, management, translation, and publication of multi-lingual information that is based on industry standards for information structures. Self-contained information units are stored in a single source and mapped onto a multiple dimensional data matrix in which the axes represent information types, objects, variants, and language. Linking through the matrix to a unique storage location for information facilities data entry, editing, access control, quality control, and automated publication of stored information.
    Type: Grant
    Filed: February 1, 2007
    Date of Patent: May 25, 2010
    Assignee: Star AG
    Inventors: Florian von Lepel, Stephan Finkler
  • Publication number: 20100114994
    Abstract: Communication from applications may be carried in XML-based events through sockets, Web services, JMS, HTTP, telnet channels, and the like to an OPC client. The OPC client may include an event engine configured to process the XML-based events, and convert them to appropriate COM/DCOM API invocations. In some embodiments, the OPC client buffers collected data from the COM/DCOM API, and transmits the buffered data in an XML event to an application based on a subscription time schedule and/or value condition. The OPC client allows service oriented event-driven applications to interact with industry devices remotely via the open architecture provided by the OPC specification using a business level language syntax.
    Type: Application
    Filed: October 8, 2008
    Publication date: May 6, 2010
    Applicant: Oracle International Corporation
    Inventors: Qiming Huang, Honghao Zhou
  • Publication number: 20100114995
    Abstract: A system that may be used to generate documents, where the system is accessible from and integratable with remote systems is provided. A document generation system that may be provided as a service, in the sense that the system may be accessed via, e.g., the Internet, from remote systems such as credit aggregators. The system takes in information regarding the requirements (e.g., type of transaction, number of parties, amount of loan, price, governing law, etc.) of the document needed (e.g., a loan application, construction contract, etc.) and produces an appropriate form based on the input information. It should be noted that access to the document generation system is effectuated by an Internet/network connection via, e.g., a partner provider system such as a DMS, a traditional credit aggregator, a credit aggregator portal, etc. Electronic signature and secure archiving may be provided for generated documents.
    Type: Application
    Filed: October 22, 2008
    Publication date: May 6, 2010
    Inventors: Kevin KOPP, Parameswaran RAMAKRISHNAN, Chris BORDEMAN
  • Patent number: 7707203
    Abstract: A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: April 27, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Joseph Ting
  • Patent number: 7702674
    Abstract: A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: April 20, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Joseph Ting
  • Patent number: 7693917
    Abstract: A method for managing a back-end information storage infrastructure and a flexible development environment for data storage using a computer system. The method includes managing system resources including a relational database. Meta data models are created to model processes and to define meta data elements and their relationships by using trees and graphs. The method manages access to the data by authenticating users through several levels of authentication describing user rights, while providing management of multi-user access and concurrency. The method includes running the processes that generate instance data, storing the instance data following the meta data model, and transforming the instance data into physical views.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: April 6, 2010
    Assignee: Intelligent Medical Objects, Inc.
    Inventors: Regis Charlot, Frank Naeymi-Rad, Alina Oganesova, Jose Maldonado, David Oran Haines
  • Publication number: 20100077011
    Abstract: A machine based tool and associated logic and methodology are used in converting data from an input form to a target form using context dependent conversion rules. In particular, a frame-slot architecture is utilized where a frame represents an intersection between a contextual cue recognized by the machine tool, associated content and related constraint information to specific to that conversion environment, whereas a slot represents an included chunk of information. An exemplary conversion system (400) includes a parser (402) for use in parsing and converting an input stream (403) from a source (404) to provide an output stream (411) in a form for use by a target system (412). To accomplish the desired conversion, the parser (402) uses information from a public schema (406), a private schema (408) and a grammar (410). The public schema (406), private schema (408) and grammar (410) may include conversion rules applicable to less than the whole of a subject matter area including the input stream (403).
    Type: Application
    Filed: May 19, 2009
    Publication date: March 25, 2010
    Inventors: Edward A. Green, Kevin L. Markey, Alee Sharp
  • Publication number: 20100070541
    Abstract: The present disclosure provides a student information state reporting system. The disclosed system allows a user to define a form that includes questions for capturing data elements related to a state report. The user also associates the form with a snapshot for automatic synchronization of at least one of the data elements. In addition, the user defines at least one field in the form to be included in the snapshot and automatically synchronized. Next, the user 118 associates the form with an output definition that matches a state report format. Preferably, an output based on the output definition for the state report is automatically scheduled, and the automatic synchronization is only applied when in an on-line mode.
    Type: Application
    Filed: September 3, 2009
    Publication date: March 18, 2010
    Applicant: METAPHOR SOFTWARE, INC.
    Inventor: Scott John Orr
  • Patent number: 7680867
    Abstract: A method and apparatus are disclosed for transforming information from one semantic environment to another. In one implementation, a SOLx system (1700) includes a Normalization/Translation NorTran Workbench (1702) and a SOLx server (1708). The NorTran Workbench (1702) is used to develop a knowledge base based on information from a source system (1712), to normalize legacy content (1710) according to various rules, and to develop a database (1706) of translated content. During run time, the SOLx server (1708) receives transmissions from the source system (1712), normalizes the transmitted content, accesses the database (1706) of translated content and otherwise translates the normalized content, and reconstructs the transmission to provide substantially real-time transformation of electronic messages.
    Type: Grant
    Filed: January 10, 2006
    Date of Patent: March 16, 2010
    Assignee: Silver Creek Systems, Inc.
    Inventors: Edward A. Green, Ramon Krosley, Kevin L. Markey
  • Patent number: 7680854
    Abstract: A computer system and method for capture and handling job listings obtained from various often unrelated corporate and job board postings via the internet for examination by a job searcher. This system includes a scraping module having one or more scraping engines operable to scrape job information data set from job listings on the corporate career sites and job boards, wherein the scraping module receives and stores the scraped job information data set in a database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: March 16, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Pal Takacsi-Nagy
  • Patent number: 7680855
    Abstract: A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: March 16, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Joseph Ting
  • Patent number: 7676523
    Abstract: A method and system are described for managing data quality. An example method may include obtaining a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group, each data item including data attribute values, each data quality item including data quality attribute values associated with one of the data items. The first aggregated data quality value, a first indicator associating the first aggregated data quality value with the first group, and the first group may be selected. The first group and the first indicator may be stored in a user table of a database. A data quality table associated with the user table may be determined based on an entry in a system table. The first aggregated data quality value and the first indicator may be stored in the data quality table.
    Type: Grant
    Filed: April 20, 2007
    Date of Patent: March 9, 2010
    Assignee: SAP AG
    Inventors: Anja Klein, Hong-Hai Do, Gregor Hackenbroich, Juergen Anke
  • Patent number: 7676522
    Abstract: A method and system are described for including data quality in data streams. An example method may include obtaining a first group of data items, each data item including one or more data attribute values. A first group of data quality items may be determined, each data quality item including one or more data quality attribute values associated with one of the data items of the first group. A first aggregated data quality value may be determined based on the first group of data quality items. A first data stream interval including the first group of data items and the first aggregated data quality value may be output.
    Type: Grant
    Filed: April 20, 2007
    Date of Patent: March 9, 2010
    Assignee: SAP AG
    Inventors: Anja Klein, Hong-Hai Do, Gregor Hackenbroich