Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 8868370
    Abstract: The present invention is a sample analyzing system, including a sample analyzer and a management apparatus connected to the sample analyzer via a communication network. The management apparatus includes: a first memory that stores a computer program for the sample analyzer and manual data which corresponds to a version of the computer program; a first communication device; and a first controller configured to transmit, via the first communication device to the sample analyzer, the computer program and the manual data corresponding to the version of the computer program stored in the first memory. The sample analyzer includes: a second communication device; a second memory that stores the computer program and the manual data received by the second communication device; and a second controller configured to execute the computer program stored in the second memory.
    Type: Grant
    Filed: September 22, 2011
    Date of Patent: October 21, 2014
    Assignee: Sysmex Corporation
    Inventors: Naoki Shindo, Yusuke Suga, Aya Konishi, Daigo Fukuma, Keisuke Kuwano
  • Patent number: 8868485
    Abstract: Techniques are disclosed for cost modeling in a data flow model. In one embodiment, a data flow model is provided that includes multiple data flow objects, each data flow object including one or more data units. Each data flow object represents a respective information asset. Quality and cost components are assigned to each data unit of a first data flow object. A cost property is determined for each data unit of the first data flow object, based on the assigned components. An object cost property is also determined for the first data flow object. The determined cost properties may be used to identify information assets that are more cost-effective to upgrade, relative to other information assets.
    Type: Grant
    Filed: June 16, 2011
    Date of Patent: October 21, 2014
    Assignee: International Business Machines Corporation
    Inventors: David T. Meeks, Michael J. Morton
  • Patent number: 8868484
    Abstract: An aspect of the present invention facilitates efficient updating of rows in a data warehouse. In one embodiment, a sequence of identifiers is allocated to rows of data elements received earlier, and then inserted along with the received data elements in a table according to the defined by the sequence of identifiers. On receiving data elements as updates to the rows having associated identifiers, the received data elements are sorted according to the order of the associated identifiers and updates of the rows are performed successively in the sorted order of identifiers. According to another aspect, the above features are performed when (target) fact/dimension tables in the data warehouse are sought to be updated based on transformations/computations performed on data in other (source) fact/dimension tables.
    Type: Grant
    Filed: July 8, 2010
    Date of Patent: October 21, 2014
    Assignee: Oracle International Corporation
    Inventor: Nitin Aggarwal
  • Publication number: 20140310232
    Abstract: The invention relates to a computer system for both online transaction processing and online analytical processing, comprising: a processor coupled to a database, the database comprising the database comprising: a main store (116) for storing records, a differential buffer (114) for receiving and buffering added or deleted or modified records, the differential buffer being coupled to the main store, a schema comprising records stored in the main store and records stored in the differential buffer, and a cache store (112) for caching a result of a query against the schema; and a cache controller (106) executable by the processor and communicatively coupled to the database, the cache controller being configured for: storing the result of the query in the cache store; receiving an analytical request; and determining, in response to the received request, an up-to-date result of the query by (216): accessing the cache store to obtain the cached result; determining the records of the schema that have been added or
    Type: Application
    Filed: February 10, 2014
    Publication date: October 16, 2014
    Applicant: Hasso-Plattner-Institut für Softwaresystemtechnik GmbH
    Inventors: Hasso Plattner, Stephan Mueller, Jens Krueger, Juergen Mueller, Christian Schwarz
  • Publication number: 20140310233
    Abstract: Systems and methods to provide tracing of data through a transformation process using tracers added to the data during the transformation. In one embodiment, a method includes: receiving a plurality of data records; processing the data records using a plurality of transformation rules, wherein each of the transformation rules corresponds to a respective one of a plurality of tracers; and adding the tracers to the data records as needed to indicate those of the transformation rules used to process each data record. After processing the data records using the transformation rules, the data records are provided to a target system, and the transformation process is validated using the tracers added to the data records.
    Type: Application
    Filed: April 14, 2014
    Publication date: October 16, 2014
    Applicant: Validus Solutions, Inc.
    Inventors: Richard John Catalano, SR., Daryl David Crockett-Grabus
  • Publication number: 20140310231
    Abstract: A system and computer-implemented method for automating data warehousing processes is provided. The system comprises a code generator configured to generate codes for Extract, Transform and Load (ETL) tools, wherein the codes facilitate the ETL tools in extracting, transforming and loading data read from data sources. The system further comprises a code reviewer configured to review and analyze the generated codes. Furthermore, the system comprises a data migration module configured to facilitate migrating the data read from the data sources to one or more data warehouses. Also, the system comprises a data generator configured to mask the data read from the data sources to generate processed data. In addition, the system comprises a Data Warehouse Quality Assurance module configured to facilitate testing the read and the processed data. The system further comprises a reporting module configured to provide status reports on the data warehousing processes.
    Type: Application
    Filed: June 3, 2013
    Publication date: October 16, 2014
    Inventors: Ramkumar Sampathkumaran, Kamalnath Chandrasekaran, Arun Ramkumar
  • Patent number: 8862537
    Abstract: Obfuscating data is disclosed. A processor identifies structured information in log data. The structured information is transformed in a manner that preserves the structure to form transformed raw data. The transformed raw data is sent to a remote analysis engine. The remote analysis engine receives a query and responds to the query by providing as results at least a portion of the transformed raw data. A processor is configured to de-transform the transformed raw data.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: October 14, 2014
    Assignee: Sumo Logic
    Inventors: Bruno Kurtic, Stefan Christoph Zier, Christian Friedrich Beedgen, Kumar Saurabh
  • Publication number: 20140304217
    Abstract: Disclosed are methods and apparatuses for reducing the number of database calls when transferring data from a database to a server. One embodiment provides an approach for utilizing staging tables to collate and transform core data and related data in the database before transferring the data to the server, so that the number of database calls used may be reduced.
    Type: Application
    Filed: April 8, 2014
    Publication date: October 9, 2014
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: David Wayne NOWAKOWSKI, Scott Gene BUSS, John David BILTHUIS, Pradip Kumar PANDEY, George R. POGMORE, John David HOLDER
  • Publication number: 20140304216
    Abstract: Systems and methods are provided for searchable screen sharing sessions. A system records at least one image associated with a screen sharing session. The system extracts content from the at least one image. The system stores the content as searchable content associated with the screen sharing session.
    Type: Application
    Filed: February 14, 2014
    Publication date: October 9, 2014
    Applicant: salesforce.com, Inc.
    Inventors: Anand Shyam Rohra, Igor Morozov
  • Publication number: 20140304215
    Abstract: One embodiment of the present disclosure is a method of creating metadata during object development. The method comprises receiving a change to an object during its development that results in a changed version of the object, identifying information about the change, and creating metadata comprising the information about the change. The information about the change may include a unique instance identifier identifying and unique to the changed version of the object. As an object is changed multiple times during development, the created metadata may include a series of information segments each relating to a particular change and each uniquely identified by its unique instance identifier. The information about the change may also include, as examples, an identification of a unique instance identifier of a prior version of the object, the time of the change to the object, and/or identification of the software used to make the change.
    Type: Application
    Filed: May 29, 2008
    Publication date: October 9, 2014
    Applicant: Adobe Systems Incorporated
    Inventors: Larry Melvin Masinter, Stephen Arnulf Deach, David P. Simons
  • Patent number: 8856064
    Abstract: A method and apparatus for information repository workflows to transfer information between a first domain, such as healthcare sites, and a second domain, such as medical research facilities. Large quantities of medical information may be directly transferred to an information repository or indirectly transferred to the repository through the use of pointers. The information is cleansed and normalized prior to storage in a production database within the repository. The cleansing process is conducted while ensuring integrity of the production database is maintained and while continuing to receive additional information transfers. Errors encountered during processing are logged and reported.
    Type: Grant
    Filed: February 21, 2012
    Date of Patent: October 7, 2014
    Assignee: International Business Machines Corporation
    Inventors: James R. Devries, Saleem Hussain, Cale T. Rath, Saeid Sakhitab
  • Patent number: 8856156
    Abstract: Systems, methods and computer-readable media are provided for facilitating patient health care by providing discovery, validation, and quality assurance of nomenclatural linkages between pairs of terms or combinations of terms in databases extant on multiple different health information systems that do not share a set of unified codesets, nomenclatures, or ontologies, or that may in part rely upon unstructured free-text narrative content instead of codes or standardized tags. Embodiments discover semantic structures existing naturally in documents and records, including relationships of synonymy and polysemy between terms arising from disparate processes, and maintained by different information systems. In some embodiments, this process is facilitated by applying Latent Semantic Analysis in concert with decision-tree induction and similarity metrics.
    Type: Grant
    Filed: October 5, 2012
    Date of Patent: October 7, 2014
    Assignee: Cerner Innovation, Inc.
    Inventors: Douglas S. McNair, John Christopher Murrish, Kanakasabha Kailasam
  • Publication number: 20140297584
    Abstract: A method for context transfer for data storage is disclosed. The method includes receiving, from an analysis environment, a request to load a data item for analysis, creating, based on the request, a task request including context information of the data item, transmitting the task request to a data manager environment, receiving, from the data manager environment, an approval of the task request based on the context information, and loading the data item in a master data store in response to the approval.
    Type: Application
    Filed: March 27, 2014
    Publication date: October 2, 2014
    Applicant: SCHLUMBERGER TECHNOLOGY CORPORATION
    Inventors: Chase Jenkins, Hallgrim Ludvigsen, Todd Olsen, Rene Radhay, Trond Benum
  • Publication number: 20140297583
    Abstract: Techniques, systems, and articles of manufacture for determining statistics for cost-based optimization of a workflow. A method includes generating individual sets of statistics for each intermediate relation of a workflow, wherein said intermediate relations comprise results of stages of any plan of the workflow, and wherein each individual set of statistics computes cardinality of the corresponding intermediate relation; determining a global set of statistics for the workflow, wherein said global set of statistics comprises at least one of the individual sets of statistics for each of the intermediate relations; instrumenting a given plan of the workflow to collect the global set of statistics during execution; executing the given plan to collect the global set of statistics; and determining a plan of the workflow with the lowest cost by comparing the cost of multiple plans, wherein the cost of each plan is derived from the global set of statistics.
    Type: Application
    Filed: March 29, 2013
    Publication date: October 2, 2014
    Applicant: International Business Machines Corporation
    Inventors: Ramanujam S. Halasipuram, Prasad M. Deshpande, Sriram Padmanabhan
  • Patent number: 8849746
    Abstract: An event tap associated with a server, such as a Web server, at a machine can transform a server event into a tuple, select a database node for the tuple, and place the tuple in a queue for that database node, and then flush the queue periodically directly into database nodes. The use of an event tap can thus reduce the computational burden on the database while keeping the server event data in the database relatively fresh.
    Type: Grant
    Filed: December 19, 2006
    Date of Patent: September 30, 2014
    Assignee: Teradata US, Inc.
    Inventors: George Candea, Anastasios Argyros, Mayank Bawa
  • Patent number: 8849745
    Abstract: Improved decision support techniques are disclosed. For example, a technique for processing transactional data for use in accordance with a decision support operation comprises the following steps/operations. The transactional data is transformed into an interim form according to a relative importance of the transactional data to the decision support operation. The transformed data is incrementally analyzed, wherein the incremental analysis comprises applying a successive approximation to the data in the interim form, such that the decision support operation can be incrementally performed based on a transactional load. The technique may further comprise allocating one or more computing resources to incrementally perform the decision support operation.
    Type: Grant
    Filed: December 23, 2005
    Date of Patent: September 30, 2014
    Assignee: International Business Machines Corporation
    Inventors: Murray S. Campbell, Chung-Sheng Li, Jeanette M. Rosenthal
  • Patent number: 8849747
    Abstract: Systems, methods, apparatus, and computer-readable media for business process management include identifying a business process model generated with a business process modeling tool; transforming the business process model defined in a first format to one or more artifacts defined in a second format; deploying an output file defined by the one or more artifacts in an in-memory database; and executing the output file in the in-memory database base at least in part on one or more processes executing in the in-memory database in response to a received query for data stored in the in-memory database.
    Type: Grant
    Filed: April 24, 2012
    Date of Patent: September 30, 2014
    Assignee: SAP AG
    Inventors: Sören Balko, Kesavaprakash Vasudevan
  • Publication number: 20140289183
    Abstract: A system and method of key figure data filters are presented. The key figure data filters are implemented in an analytical engine of a business warehouse system. The key figure data filters employ conditions, which can be expressed as a kind of selection that describe a set. A key figure data algorithm can be implemented by the analytical engine using the conditions, yet still respect hierarchies in the business warehouse database.
    Type: Application
    Filed: March 21, 2013
    Publication date: September 25, 2014
    Inventors: Stefan Dipper, Sebastian Schroetel, Stefan Unnebrink, Michael Wilking
  • Patent number: 8843527
    Abstract: An approach is provided in which a processor receives a scan request to scan data included in a data table. The processor selects a column in the data table corresponding to the scan request and retrieves column data entries from the selected column. In addition, the processor identifies the width of the selected column and selects a scan algorithm based upon the identified column width. In turn, the processor loads the column data entries into column data vectors and computes scan results from the column data vectors using the selected scan algorithm.
    Type: Grant
    Filed: January 21, 2013
    Date of Patent: September 23, 2014
    Assignee: International Business Machines Corporation
    Inventors: Eduard Diner, Jens Leenstra, Vijayshankar Raman, Adam J. Storm
  • Patent number: 8843436
    Abstract: In one embodiment, the present invention includes a computer-implemented method comprising receiving a query and generating first and second requests. The first request is sent to an analytic data store and includes a data transfer indicator representing a time period when the data record was received in the analytic data store. The second request is sent to a transactional model proxy and includes a monotonically changing value, such as a timestamp. The transactional model proxy reads data from a transactional data store based on the value in the request. Data from the analytic and transactional data stores may be aggregated and returned in response to the query.
    Type: Grant
    Filed: February 12, 2010
    Date of Patent: September 23, 2014
    Assignee: SAP AG
    Inventors: Franz Marschall, Stefan Unnebrink
  • Publication number: 20140279832
    Abstract: This disclosure in part describes systems, devices, and methods for implementing a unification mechanism that enables the unification of various data sets, which can permit greater collaboration between users of a data management platform. For example, a system can be configured to enable a user to add or associate a contextual object with a data measure. In associating the contextual object with the data measure, the system can store the contextual object with the unique dimension coordinates that are associated with data measure. Other users of the system may, in turn, access the data measure and be shown the contextual object associated with the data measure.
    Type: Application
    Filed: March 10, 2014
    Publication date: September 18, 2014
    Applicant: DECISYON, INC.
    Inventors: Antonio Vivalda, Fabrizio Del Franco, Simone Costantini, Alessandro Pratico, Franco Petrucci
  • Publication number: 20140279831
    Abstract: Techniques for data modeling are provided. Enterprise data is organized into reference data for entities that an enterprise wants to track and monitor. Relationship data is created that establishes relationships among the various entities within the enterprise data. The reference data and the relationship data are published within an enterprise data warehouse for accessing the enterprise data.
    Type: Application
    Filed: September 27, 2013
    Publication date: September 18, 2014
    Applicant: Teradata US, Inc.
    Inventors: Thomas Kevin Ryan, Anand Louis
  • Publication number: 20140279834
    Abstract: A method of operating a data analysis system includes retrieving objects from a data source. Each of the retrieved objects includes (i) data and (ii) metadata describing the data. The method further includes dynamically creating a cumulative schema by, for each object of the retrieved objects: (i) inferring a schema from the object based on the metadata of the object and inferred data types of elements of the data of the object, (ii) creating a unified schema, wherein the unified schema describes both (a) the object described by the inferred schema and (b) a cumulative set of objects described by the cumulative schema, and (iii) storing the unified schema as the cumulative schema. The method further includes exporting the data of each of the retrieved objects to a data warehouse.
    Type: Application
    Filed: March 14, 2014
    Publication date: September 18, 2014
    Applicant: AMIATO, INC.
    Inventors: Dimitris TSIROGIANNIS, Nathan A. BINKERT, Stavros HARIZOPOULOS, Mehul A. SHAH, Benjamin SOWELL, Bryan D. KAPLAN, Kevin R. MEYER
  • Publication number: 20140279828
    Abstract: A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
  • Publication number: 20140279833
    Abstract: Various embodiments are directed to techniques for providing one or more reduced-size rule cubes indicating cell rules. A computer-program product embodied in a machine-readable storage medium includes instructions to cause a computing device to select a cell rule to include in a rule cube based on applicability of the cell rule to a selected portion of a data cube; analyze the cell rule to identify a wildcarded dimension in a specification of cells of the data cube that are subject to the cell rule; and generate the rule cube indicating applicability of the cell rule to the selected portion of the data cube, wherein a cell of the rule cube corresponds to multiple cells of the data cube, and wherein the wildcarded dimension of the rule cube is reduced in length in comparison to a length of the wildcarded dimension of the data cube. Other embodiments are described and claimed.
    Type: Application
    Filed: March 13, 2014
    Publication date: September 18, 2014
    Applicant: SAS INSTITUTE INC.
    Inventors: Qing Gong, Amy Spelman Davis, Ryan Norris, Enrico Stevanoni
  • Publication number: 20140279827
    Abstract: A method, system, and computer program product for managing time dimension tables are disclosed. In a first aspect, the method comprises providing an intelligent time dimension agent application and optimizing a time dimension table by the intelligent time dimension agent based upon changing data in associated fact tables. In a second aspect, the system comprises a processor and a memory coupled to the processor, wherein the memory includes an application which, when executed by the processor, causes the processor to carry out the steps of the method. In a third aspect, the computer program product comprises a non-transitory computer readable medium having a computer readable program recorded thereon, wherein the computer readable program, when executed on a computer device, causes the computer device to carry out the steps of the method.
    Type: Application
    Filed: March 12, 2013
    Publication date: September 18, 2014
    Applicant: International Business Machines Corporation
    Inventors: Joseph P. Bigus, Leon Gong
  • Publication number: 20140279829
    Abstract: A method includes receiving data from a non-geo enabled data source, obtaining information related to location in the received data, converting the obtained information to a standardized form of geo-location data, and storing the geo-location data.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Applicant: SAP AG
    Inventor: Manfred Reinart
  • Publication number: 20140279826
    Abstract: A data feeds platform, comprising a data feed using a data feed protocol. The data feed protocol comprises, a feed header defining at least one identification attribute, a data header defining a data structure comprising data nodes, each data node corresponding to a subset of the data, and a data section, the data section storing the data according to the data structure. A plurality of heterogeneous data protocols is stored. A data input is received in one of the plurality of heterogeneous data protocols. The data input is converted to the data feed protocol, creating the data feed.
    Type: Application
    Filed: March 12, 2013
    Publication date: September 18, 2014
    Applicant: MORGAN STANLEY
    Inventors: Bhavin P. Dave, Amit S. Modi
  • Publication number: 20140279830
    Abstract: Approaches for data integration between multiple IT sources using automated data processing based on target metadata are provided. Specifically, an integration component is configured to load a mapped data set into a table with delta processing based on a configuration object containing, e.g., the source data location, target table name, and source to target mapping. The integration component uses the target metadata to validate data, identify changes, generate the necessary database programming language (e.g., structured query language (SQL)), and run the database programming language with data binding to perform the actual data updates. The integration component leverages the data target metadata to automate the processing of source data, thus providing a way to validate the data, and identify delta changes at the field level between the source and target. This significantly reduces the overall development effort, while providing consistency in record handling and error reporting.
    Type: Application
    Filed: March 15, 2013
    Publication date: September 18, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gautam Majumdar, Tarun K. Saha, Michael Q. Wang
  • Publication number: 20140279835
    Abstract: Techniques are disclosed to determine data quality issues in data processing jobs. The data processing job is received, the data processing job specifying one or more processing steps designed based on one or more data schemas and further specifies one or more desired quality metrics to measure at the one or more processing steps. One or more state machines are provided, that are generated based on the quality metrics and on the data schemas. Input data to the data process job are processed using the one or more state machines, in order to generate output data and a set of data quality records characterizing a set of data quality issues identified during the execution of the data processing job.
    Type: Application
    Filed: March 25, 2014
    Publication date: September 18, 2014
    Applicant: International Business Machines Corporation
    Inventors: Jeff J. LI, Yong LI
  • Patent number: 8839124
    Abstract: An apparatus for configuring a diagnostics module for an industrial plant includes an application module, which is configured to run on a computer and includes a graphics builder. The graphics builder includes a workspace configured to present a display area on a monitor of the computer, the display area configured to be shown by the diagnostics module on a client machine, and graphics tools configured to be manipulated via at least one user input device of the computer for customization of the display area.
    Type: Grant
    Filed: August 3, 2012
    Date of Patent: September 16, 2014
    Assignee: Yokogawa Electric Corporation
    Inventor: Ryan Justiza
  • Patent number: 8838525
    Abstract: In one embodiment, a method for communicating data is provided. The method includes defining a data transmission unit (DTU) to be sent in an xDSL data stream, defining a retransmit container as a time slot that corresponds to the DTU, maintaining, in a retransmit container, a copy of the DTU and an index corresponding to the retransmit container, the retransmit container being maintained at the second device independently of the first device, transmitting the DTU in the xDSL data stream, determining whether the DTU should be retransmitted, and if the DTU should be retransmitted: identifying the DTU by the retransmit container and retransmitting a copy of the DTU as identified by the retransmit container when the DTU reaches a predetermined position in the retransmission buffer.
    Type: Grant
    Filed: June 21, 2012
    Date of Patent: September 16, 2014
    Assignee: Broadcom Corporation
    Inventors: Benoit Christiaens, Miguel Peeters, Raphael Cassiers
  • Patent number: 8838626
    Abstract: Embodiments of techniques and systems for parallel XML parsing are described. An event-level XML parser may include a lightweight events partitioning stage, parallel events parsing stages, and a post-processing stage. The events partition may pick out event boundaries using single-instruction, multiple-data instructions to find occurrences of the “<” character, marking event boundaries. Subsequent checking may be performed to help identify other event boundaries, as well as non-boundary instances of the “<” character. During events parsing, unresolved items, such as namespace resolution or matching of start and end elements, may be recorded in structure metadata. This structure metadata may be used during the subsequent post-processing to perform a check of the XML data. If the XML data is well-formed, individual sub-event streams formed by the events parsing processes may be assembled into a flat result event stream structure. Other embodiments may be described and claimed.
    Type: Grant
    Filed: December 17, 2009
    Date of Patent: September 16, 2014
    Assignee: Intel Corporation
    Inventors: Zhiqiang Yu, Yuejian Fang, Lei Zhai, Yun Wang, Zhonghai Wu, Mo Dai
  • Publication number: 20140258209
    Abstract: Systems and methods for interest-driven business intelligence systems in accordance with embodiments of the invention are illustrated. In one embodiment, an interest-driven distributed data server system includes a processor, an interest-driven distributed data server application, aggregate data storage, and a data source directory, wherein the interest-driven distributed data server application configures the processor to obtain reporting data requirements, determine at least one data dimension using the reporting data requirements, identify at least one data source identified that can provide data associated with the data dimension, generate data source job data, transmit the data source job data to at least one data source, and receive job response data, store the received job response data in the aggregate data storage, build reporting data based on the reporting data requirements, and transmit the reporting data.
    Type: Application
    Filed: March 8, 2013
    Publication date: September 11, 2014
    Applicant: Platfora, Inc.
    Inventors: John Glenn Eshleman, Kevin Scott Beyer, Brian Babcock, Carl-Christian Kanne
  • Publication number: 20140258210
    Abstract: The present subject matter relates to generation of business intelligence reports with navigable reference indicators. The method comprises obtaining information from an information source for providing in a business intelligence report. Further, report parameters are obtained through a user interface, wherein the report parameters include at least notes and indicator parameters. The notes and indicator parameters indicate a plurality of navigable reference indicators to be included in the business intelligence report. The method further comprises mapping the notes and indicator parameters with the obtained information. Based on the mapping, the business intelligence report is generated with the plurality of navigable reference indicators. Further, the plurality of navigable reference indicators provides a forward and backward link between two or more sections of the business intelligence report.
    Type: Application
    Filed: February 18, 2014
    Publication date: September 11, 2014
    Applicant: Tata Consultancy Services Limited
    Inventor: Malukanparampil Meer Mohammed Riyaz
  • Patent number: 8832020
    Abstract: A computer system architecture and process transforms the organizing principles of business data management seen in traditional or common place settings across disparate data stores into an object-semantic model. The transformation is implemented in a transitory, managed risk implementation approach for data access migration. This approach will enable data agility and will move data storage closer to information storage, while supporting the rapid development of business capabilities. This approach will bridge the gaps between relational database/file based storage (less expressive) and object-semantic representation (more expressive). This process/method will connect information residing in legacy data structures such as relational database/file based storage on disk drives to instances populating an expressive object-semantic model in persistent grid based memory.
    Type: Grant
    Filed: November 15, 2011
    Date of Patent: September 9, 2014
    Assignee: Medco Health Solutions, Inc.
    Inventors: Mark Joseph Landy, Jimmy Chou, Tej Anand, Tanya Kaplun, Sarah Mord Myers, Peter Foreman
  • Patent number: 8832016
    Abstract: An approach is provided for providing recommendation services while protecting the privacy of personal user information. The approach involves determining a request for at least one recommendation, the request specifying at least in part an anonymized user preference record. The approach also involves causing, at least in part, a comparison of the anonymized user preference record against one or more previously stored anonymized user preference records. The approach further involves causing, at least in part, a selection of the anonymized user preference record, the one or more previously stored anonymized user preference records, or a combination thereof based, at least in part, on the comparison. The approach additionally involves determining to generate the at least one recommendation based, at least in part, on the selection.
    Type: Grant
    Filed: December 9, 2011
    Date of Patent: September 9, 2014
    Assignee: Nokia Corporation
    Inventor: Jonathan Ledlie
  • Patent number: 8832158
    Abstract: An approach is provided in which a processor receives a scan request to scan data included in a data table. The processor selects a column in the data table corresponding to the scan request and retrieves column data entries from the selected column. In addition, the processor identifies the width of the selected column and selects a scan algorithm based upon the identified column width. In turn, the processor loads the column data entries into column data vectors and computes scan results from the column data vectors using the selected scan algorithm.
    Type: Grant
    Filed: March 29, 2012
    Date of Patent: September 9, 2014
    Assignee: International Business Machines Corporation
    Inventors: Eduard Diner, Jens Leenstra, Vijayshankar Raman, Adam J. Storm
  • Patent number: 8832601
    Abstract: A method and apparatus for an ETL tool utilizing dimension trees. The method may include identifying various dimensions associated with one or more databases, and creating dimension trees based on these dimensions. The method may further include creating an ETL pyramid having multiple levels, linking the dimension trees to corresponding levels of the ETL pyramid, and creating a table for each level of the ETL pyramid to store a subset of the dimension trees that is linked to a corresponding level of the ETL pyramid.
    Type: Grant
    Filed: May 31, 2008
    Date of Patent: September 9, 2014
    Assignee: Red Hat, Inc.
    Inventor: Eric J. Williamson
  • Publication number: 20140250054
    Abstract: A data management system for calculating and retrieving analytic data involves a computing device coupled to a database. The system includes a measure data structure within the database including a measure-data element, a collection data structure within the database including a collection-data element, and a fact data structure within the database, and configured to store a fact-data element which includes an aggregated value, a link to the measure-data element, and a link to the collection-data element. The system also includes a fact-analyzer module configured to generate the aggregated value from the transaction data utilizing at least the measure-data element and the collection-data element, wherein the measure-data element defines how the aggregated value is calculated and the collection-data element defines a categorization of the aggregated value.
    Type: Application
    Filed: March 4, 2013
    Publication date: September 4, 2014
    Applicant: MASTERCARD INTERNATIONAL INCORPORATED
    Inventor: William C. Jones
  • Patent number: 8825592
    Abstract: A computer-implemented method of extracting data from a document in an electronic format. The method includes the steps of accessing a file in an electronic format from a memory module; extracting data from the file corresponding to a plurality of keys contained within a mapping structure stored in the memory module; organizing the extracted data into values, wherein each value maps to one of the plurality of keys to form a hash map; storing the hash map in a database; and providing a user access to the database via an output device. The output device allows the user to view a customizable document whose content is derived from the values and keys stored in the database.
    Type: Grant
    Filed: March 10, 2009
    Date of Patent: September 2, 2014
    Assignee: Web Access, Inc.
    Inventors: Stephen A. Lobo, Claudian Fernandes
  • Patent number: 8825628
    Abstract: The disclosure presents a method, system and computer-readable medium related to automatically analyzing structure for a web page. The method embodiment comprises building a training corpus comprising a broad stylistic coverage of web pages, segmenting a web page into information blocks, identifying semantic categories of the information blocks using the training corpus and applying the identical semantic categories in a web-based tool.
    Type: Grant
    Filed: August 18, 2010
    Date of Patent: September 2, 2014
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Junlan Feng, Barbara B. Hollister
  • Patent number: 8826125
    Abstract: A news reader is disclosed which selects the optimal articles for presentation to the user and which facilitates a sense of community among users. Upon receiving an article from a news provider, the news reader analyzes characteristics associated with the article. Amongst other things, the characteristics may indicate whether the article represents a breaking news story or whether the article includes advertisements. After analyzing the characteristics of the article, the news reader determines whether the article is to be designated a primary/presentable article and aggregates the article with other primary/presentable articles related to the same topic.
    Type: Grant
    Filed: March 12, 2012
    Date of Patent: September 2, 2014
    Assignee: Hyperion Media LLC
    Inventor: Nelson Chu
  • Patent number: 8826277
    Abstract: High level characteristics are extracted from a virtual machine provisioning request stream for a cloud computing environment. A plurality of prediction models are generated for a plurality of different types of virtual machines. A regression-correlation ensemble is applied to appropriate ones of the prediction models to predict future virtual machine demand for the cloud computing environment.
    Type: Grant
    Filed: November 29, 2011
    Date of Patent: September 2, 2014
    Assignee: International Business Machines Corporation
    Inventors: Rong N. Chang, Mihwa Choi, Yexi Jiang, Meir Laker, Chang-Shing Perng, Hidayatullah H. Shaikh, Edward C. So, Tao Tao
  • Patent number: 8825593
    Abstract: A system for aggregating data from a plurality of operational databases, and a method for providing the same, are provided. The system includes a data store storing a collection of pre-defined extraction scripts. The extraction scripts identify data available for extraction from a plurality of operational database products, and including parameters for specifying which of the data to extract. A data warehouse server executes the extraction scripts to extract, merge, transform and load the specified data from the plurality of operational databases into a multidimensional database.
    Type: Grant
    Filed: March 26, 2010
    Date of Patent: September 2, 2014
    Assignee: Trapeze Software ULC
    Inventors: Matthew Dodds, Chacko Kattithara Paul
  • Patent number: 8825591
    Abstract: System and method for tracking statistics at the subfile level and transparently placing or migrating inactive or less active blocks of data to other storage devices. Embodiments may provide mechanisms to track statistics at the subfile level of files including, but not limited to, database files, and to transparently place or migrate inactive or less active blocks of data of the files from higher-performing, typically more expensive, storage to lower-performing, typically less expensive, storage, while placing or migrating active blocks of data of the files to higher-performing storage, based on the subfile-level statistics rather than on file-level timestamps. In some embodiments, knowledge of file structure (e.g., database file structure), for example knowledge of database partitions with header blocks and data blocks in databases using data partitioning, may be used to separate more active and less active blocks of data of files onto storage with different performance and/or other characteristics.
    Type: Grant
    Filed: December 31, 2003
    Date of Patent: September 2, 2014
    Assignee: Symantec Operating Corporation
    Inventors: Xinyi David Lai, Par Botes, HanCheng Hsiung
  • Publication number: 20140244570
    Abstract: Disclosed herein are techniques for optimizing and managing the execution of hybrid flows. An execution plan is generated for each hybrid flow based at least partially on attributes associated therewith. The execution of each hybrid flow is managed in accordance with the execution plan.
    Type: Application
    Filed: February 28, 2013
    Publication date: August 28, 2014
    Applicant: Hewlett-Packard Development Company, L.P.
    Inventors: Alkiviadis Simitsis, William K. Wilkinson, Umeshwar Dayal
  • Publication number: 20140236884
    Abstract: Embodiments of systems, methods, and computer-readable mediums for analysis of datasets without predefined dimensions are generally described herein. In some embodiments, analysis of datasets without predefined dimensions may include receiving a selection of multiple dimensions of information from a database to be used for data analysis; receiving a selection for a type of report; and dynamically generating a query statement based on the selection of the multiple dimensions. Further embodiments may include the dynamically generated query statement including multiple iterative join clauses as a function of the selected dimensions; executing the query statement against an in-memory database; and displaying columns of information obtained from execution of the query statement.
    Type: Application
    Filed: February 15, 2013
    Publication date: August 21, 2014
    Applicant: SAP AG
    Inventors: Mitchell Clark, Celso da Silveira, Julian Ogando
  • Patent number: 8812449
    Abstract: A storage system having a plurality of storage devices including a first type storage device and a second type storage device, a reliability attribute and/or a performance attribute of the first type storage device being different from a reliability attribute and/or a performance attribute of the second type storage device. The storage system also has a control unit and managing a plurality of virtual volumes. If necessary, a storage area allocated to a first portion of a virtual volume of the plurality of virtual volumes is changed from a first type storage area of the plurality of first type storage areas to a second type storage area of the plurality of second type storage areas while another first type storage area of the plurality of first type storage areas is allocated to a second portion of the virtual volume.
    Type: Grant
    Filed: September 10, 2012
    Date of Patent: August 19, 2014
    Assignee: Hitachi, Ltd.
    Inventors: Teiko Kezuka, Akira Murotani, Seiichi Higaki
  • Patent number: 8812433
    Abstract: Multi-dimensional surrogation systems and methods are provided that generate at least one up-to-date data surrogate using information of data and numerous data changes received from at least one data source, including a production system. Embodiments described further perform bulk-to-brick transformation of the up-to-date data surrogate. Brick-level data, or item data, is further indexed and stored in an application-aware manner. The stored item data is easily accessible by many data management applications for integrated item search and recovery functions, audit functions, monitoring and supervision functions, legal discovery functions, compliance functions, archival functions, backup functions, disaster recovery functions, and more. Bulk-to-brick transformation and access of the stored item data occur off of the production system, thus contributing no performance degradation to the production system.
    Type: Grant
    Filed: October 2, 2006
    Date of Patent: August 19, 2014
    Assignee: Mimosa Systems, Inc.
    Inventors: Roy P. D'Souza, Thirumalai Muppur Ravi