Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
-
Patent number: 8868370Abstract: The present invention is a sample analyzing system, including a sample analyzer and a management apparatus connected to the sample analyzer via a communication network. The management apparatus includes: a first memory that stores a computer program for the sample analyzer and manual data which corresponds to a version of the computer program; a first communication device; and a first controller configured to transmit, via the first communication device to the sample analyzer, the computer program and the manual data corresponding to the version of the computer program stored in the first memory. The sample analyzer includes: a second communication device; a second memory that stores the computer program and the manual data received by the second communication device; and a second controller configured to execute the computer program stored in the second memory.Type: GrantFiled: September 22, 2011Date of Patent: October 21, 2014Assignee: Sysmex CorporationInventors: Naoki Shindo, Yusuke Suga, Aya Konishi, Daigo Fukuma, Keisuke Kuwano
-
Patent number: 8868485Abstract: Techniques are disclosed for cost modeling in a data flow model. In one embodiment, a data flow model is provided that includes multiple data flow objects, each data flow object including one or more data units. Each data flow object represents a respective information asset. Quality and cost components are assigned to each data unit of a first data flow object. A cost property is determined for each data unit of the first data flow object, based on the assigned components. An object cost property is also determined for the first data flow object. The determined cost properties may be used to identify information assets that are more cost-effective to upgrade, relative to other information assets.Type: GrantFiled: June 16, 2011Date of Patent: October 21, 2014Assignee: International Business Machines CorporationInventors: David T. Meeks, Michael J. Morton
-
Patent number: 8868484Abstract: An aspect of the present invention facilitates efficient updating of rows in a data warehouse. In one embodiment, a sequence of identifiers is allocated to rows of data elements received earlier, and then inserted along with the received data elements in a table according to the defined by the sequence of identifiers. On receiving data elements as updates to the rows having associated identifiers, the received data elements are sorted according to the order of the associated identifiers and updates of the rows are performed successively in the sorted order of identifiers. According to another aspect, the above features are performed when (target) fact/dimension tables in the data warehouse are sought to be updated based on transformations/computations performed on data in other (source) fact/dimension tables.Type: GrantFiled: July 8, 2010Date of Patent: October 21, 2014Assignee: Oracle International CorporationInventor: Nitin Aggarwal
-
Publication number: 20140310232Abstract: The invention relates to a computer system for both online transaction processing and online analytical processing, comprising: a processor coupled to a database, the database comprising the database comprising: a main store (116) for storing records, a differential buffer (114) for receiving and buffering added or deleted or modified records, the differential buffer being coupled to the main store, a schema comprising records stored in the main store and records stored in the differential buffer, and a cache store (112) for caching a result of a query against the schema; and a cache controller (106) executable by the processor and communicatively coupled to the database, the cache controller being configured for: storing the result of the query in the cache store; receiving an analytical request; and determining, in response to the received request, an up-to-date result of the query by (216): accessing the cache store to obtain the cached result; determining the records of the schema that have been added orType: ApplicationFiled: February 10, 2014Publication date: October 16, 2014Applicant: Hasso-Plattner-Institut für Softwaresystemtechnik GmbHInventors: Hasso Plattner, Stephan Mueller, Jens Krueger, Juergen Mueller, Christian Schwarz
-
Publication number: 20140310233Abstract: Systems and methods to provide tracing of data through a transformation process using tracers added to the data during the transformation. In one embodiment, a method includes: receiving a plurality of data records; processing the data records using a plurality of transformation rules, wherein each of the transformation rules corresponds to a respective one of a plurality of tracers; and adding the tracers to the data records as needed to indicate those of the transformation rules used to process each data record. After processing the data records using the transformation rules, the data records are provided to a target system, and the transformation process is validated using the tracers added to the data records.Type: ApplicationFiled: April 14, 2014Publication date: October 16, 2014Applicant: Validus Solutions, Inc.Inventors: Richard John Catalano, SR., Daryl David Crockett-Grabus
-
Publication number: 20140310231Abstract: A system and computer-implemented method for automating data warehousing processes is provided. The system comprises a code generator configured to generate codes for Extract, Transform and Load (ETL) tools, wherein the codes facilitate the ETL tools in extracting, transforming and loading data read from data sources. The system further comprises a code reviewer configured to review and analyze the generated codes. Furthermore, the system comprises a data migration module configured to facilitate migrating the data read from the data sources to one or more data warehouses. Also, the system comprises a data generator configured to mask the data read from the data sources to generate processed data. In addition, the system comprises a Data Warehouse Quality Assurance module configured to facilitate testing the read and the processed data. The system further comprises a reporting module configured to provide status reports on the data warehousing processes.Type: ApplicationFiled: June 3, 2013Publication date: October 16, 2014Inventors: Ramkumar Sampathkumaran, Kamalnath Chandrasekaran, Arun Ramkumar
-
Patent number: 8862537Abstract: Obfuscating data is disclosed. A processor identifies structured information in log data. The structured information is transformed in a manner that preserves the structure to form transformed raw data. The transformed raw data is sent to a remote analysis engine. The remote analysis engine receives a query and responds to the query by providing as results at least a portion of the transformed raw data. A processor is configured to de-transform the transformed raw data.Type: GrantFiled: June 30, 2011Date of Patent: October 14, 2014Assignee: Sumo LogicInventors: Bruno Kurtic, Stefan Christoph Zier, Christian Friedrich Beedgen, Kumar Saurabh
-
Publication number: 20140304217Abstract: Disclosed are methods and apparatuses for reducing the number of database calls when transferring data from a database to a server. One embodiment provides an approach for utilizing staging tables to collate and transform core data and related data in the database before transferring the data to the server, so that the number of database calls used may be reduced.Type: ApplicationFiled: April 8, 2014Publication date: October 9, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: David Wayne NOWAKOWSKI, Scott Gene BUSS, John David BILTHUIS, Pradip Kumar PANDEY, George R. POGMORE, John David HOLDER
-
Publication number: 20140304216Abstract: Systems and methods are provided for searchable screen sharing sessions. A system records at least one image associated with a screen sharing session. The system extracts content from the at least one image. The system stores the content as searchable content associated with the screen sharing session.Type: ApplicationFiled: February 14, 2014Publication date: October 9, 2014Applicant: salesforce.com, Inc.Inventors: Anand Shyam Rohra, Igor Morozov
-
Publication number: 20140304215Abstract: One embodiment of the present disclosure is a method of creating metadata during object development. The method comprises receiving a change to an object during its development that results in a changed version of the object, identifying information about the change, and creating metadata comprising the information about the change. The information about the change may include a unique instance identifier identifying and unique to the changed version of the object. As an object is changed multiple times during development, the created metadata may include a series of information segments each relating to a particular change and each uniquely identified by its unique instance identifier. The information about the change may also include, as examples, an identification of a unique instance identifier of a prior version of the object, the time of the change to the object, and/or identification of the software used to make the change.Type: ApplicationFiled: May 29, 2008Publication date: October 9, 2014Applicant: Adobe Systems IncorporatedInventors: Larry Melvin Masinter, Stephen Arnulf Deach, David P. Simons
-
Patent number: 8856064Abstract: A method and apparatus for information repository workflows to transfer information between a first domain, such as healthcare sites, and a second domain, such as medical research facilities. Large quantities of medical information may be directly transferred to an information repository or indirectly transferred to the repository through the use of pointers. The information is cleansed and normalized prior to storage in a production database within the repository. The cleansing process is conducted while ensuring integrity of the production database is maintained and while continuing to receive additional information transfers. Errors encountered during processing are logged and reported.Type: GrantFiled: February 21, 2012Date of Patent: October 7, 2014Assignee: International Business Machines CorporationInventors: James R. Devries, Saleem Hussain, Cale T. Rath, Saeid Sakhitab
-
Patent number: 8856156Abstract: Systems, methods and computer-readable media are provided for facilitating patient health care by providing discovery, validation, and quality assurance of nomenclatural linkages between pairs of terms or combinations of terms in databases extant on multiple different health information systems that do not share a set of unified codesets, nomenclatures, or ontologies, or that may in part rely upon unstructured free-text narrative content instead of codes or standardized tags. Embodiments discover semantic structures existing naturally in documents and records, including relationships of synonymy and polysemy between terms arising from disparate processes, and maintained by different information systems. In some embodiments, this process is facilitated by applying Latent Semantic Analysis in concert with decision-tree induction and similarity metrics.Type: GrantFiled: October 5, 2012Date of Patent: October 7, 2014Assignee: Cerner Innovation, Inc.Inventors: Douglas S. McNair, John Christopher Murrish, Kanakasabha Kailasam
-
Publication number: 20140297584Abstract: A method for context transfer for data storage is disclosed. The method includes receiving, from an analysis environment, a request to load a data item for analysis, creating, based on the request, a task request including context information of the data item, transmitting the task request to a data manager environment, receiving, from the data manager environment, an approval of the task request based on the context information, and loading the data item in a master data store in response to the approval.Type: ApplicationFiled: March 27, 2014Publication date: October 2, 2014Applicant: SCHLUMBERGER TECHNOLOGY CORPORATIONInventors: Chase Jenkins, Hallgrim Ludvigsen, Todd Olsen, Rene Radhay, Trond Benum
-
Publication number: 20140297583Abstract: Techniques, systems, and articles of manufacture for determining statistics for cost-based optimization of a workflow. A method includes generating individual sets of statistics for each intermediate relation of a workflow, wherein said intermediate relations comprise results of stages of any plan of the workflow, and wherein each individual set of statistics computes cardinality of the corresponding intermediate relation; determining a global set of statistics for the workflow, wherein said global set of statistics comprises at least one of the individual sets of statistics for each of the intermediate relations; instrumenting a given plan of the workflow to collect the global set of statistics during execution; executing the given plan to collect the global set of statistics; and determining a plan of the workflow with the lowest cost by comparing the cost of multiple plans, wherein the cost of each plan is derived from the global set of statistics.Type: ApplicationFiled: March 29, 2013Publication date: October 2, 2014Applicant: International Business Machines CorporationInventors: Ramanujam S. Halasipuram, Prasad M. Deshpande, Sriram Padmanabhan
-
Patent number: 8849746Abstract: An event tap associated with a server, such as a Web server, at a machine can transform a server event into a tuple, select a database node for the tuple, and place the tuple in a queue for that database node, and then flush the queue periodically directly into database nodes. The use of an event tap can thus reduce the computational burden on the database while keeping the server event data in the database relatively fresh.Type: GrantFiled: December 19, 2006Date of Patent: September 30, 2014Assignee: Teradata US, Inc.Inventors: George Candea, Anastasios Argyros, Mayank Bawa
-
Patent number: 8849745Abstract: Improved decision support techniques are disclosed. For example, a technique for processing transactional data for use in accordance with a decision support operation comprises the following steps/operations. The transactional data is transformed into an interim form according to a relative importance of the transactional data to the decision support operation. The transformed data is incrementally analyzed, wherein the incremental analysis comprises applying a successive approximation to the data in the interim form, such that the decision support operation can be incrementally performed based on a transactional load. The technique may further comprise allocating one or more computing resources to incrementally perform the decision support operation.Type: GrantFiled: December 23, 2005Date of Patent: September 30, 2014Assignee: International Business Machines CorporationInventors: Murray S. Campbell, Chung-Sheng Li, Jeanette M. Rosenthal
-
Patent number: 8849747Abstract: Systems, methods, apparatus, and computer-readable media for business process management include identifying a business process model generated with a business process modeling tool; transforming the business process model defined in a first format to one or more artifacts defined in a second format; deploying an output file defined by the one or more artifacts in an in-memory database; and executing the output file in the in-memory database base at least in part on one or more processes executing in the in-memory database in response to a received query for data stored in the in-memory database.Type: GrantFiled: April 24, 2012Date of Patent: September 30, 2014Assignee: SAP AGInventors: Sören Balko, Kesavaprakash Vasudevan
-
Publication number: 20140289183Abstract: A system and method of key figure data filters are presented. The key figure data filters are implemented in an analytical engine of a business warehouse system. The key figure data filters employ conditions, which can be expressed as a kind of selection that describe a set. A key figure data algorithm can be implemented by the analytical engine using the conditions, yet still respect hierarchies in the business warehouse database.Type: ApplicationFiled: March 21, 2013Publication date: September 25, 2014Inventors: Stefan Dipper, Sebastian Schroetel, Stefan Unnebrink, Michael Wilking
-
Patent number: 8843527Abstract: An approach is provided in which a processor receives a scan request to scan data included in a data table. The processor selects a column in the data table corresponding to the scan request and retrieves column data entries from the selected column. In addition, the processor identifies the width of the selected column and selects a scan algorithm based upon the identified column width. In turn, the processor loads the column data entries into column data vectors and computes scan results from the column data vectors using the selected scan algorithm.Type: GrantFiled: January 21, 2013Date of Patent: September 23, 2014Assignee: International Business Machines CorporationInventors: Eduard Diner, Jens Leenstra, Vijayshankar Raman, Adam J. Storm
-
Patent number: 8843436Abstract: In one embodiment, the present invention includes a computer-implemented method comprising receiving a query and generating first and second requests. The first request is sent to an analytic data store and includes a data transfer indicator representing a time period when the data record was received in the analytic data store. The second request is sent to a transactional model proxy and includes a monotonically changing value, such as a timestamp. The transactional model proxy reads data from a transactional data store based on the value in the request. Data from the analytic and transactional data stores may be aggregated and returned in response to the query.Type: GrantFiled: February 12, 2010Date of Patent: September 23, 2014Assignee: SAP AGInventors: Franz Marschall, Stefan Unnebrink
-
Publication number: 20140279832Abstract: This disclosure in part describes systems, devices, and methods for implementing a unification mechanism that enables the unification of various data sets, which can permit greater collaboration between users of a data management platform. For example, a system can be configured to enable a user to add or associate a contextual object with a data measure. In associating the contextual object with the data measure, the system can store the contextual object with the unique dimension coordinates that are associated with data measure. Other users of the system may, in turn, access the data measure and be shown the contextual object associated with the data measure.Type: ApplicationFiled: March 10, 2014Publication date: September 18, 2014Applicant: DECISYON, INC.Inventors: Antonio Vivalda, Fabrizio Del Franco, Simone Costantini, Alessandro Pratico, Franco Petrucci
-
Publication number: 20140279831Abstract: Techniques for data modeling are provided. Enterprise data is organized into reference data for entities that an enterprise wants to track and monitor. Relationship data is created that establishes relationships among the various entities within the enterprise data. The reference data and the relationship data are published within an enterprise data warehouse for accessing the enterprise data.Type: ApplicationFiled: September 27, 2013Publication date: September 18, 2014Applicant: Teradata US, Inc.Inventors: Thomas Kevin Ryan, Anand Louis
-
Publication number: 20140279834Abstract: A method of operating a data analysis system includes retrieving objects from a data source. Each of the retrieved objects includes (i) data and (ii) metadata describing the data. The method further includes dynamically creating a cumulative schema by, for each object of the retrieved objects: (i) inferring a schema from the object based on the metadata of the object and inferred data types of elements of the data of the object, (ii) creating a unified schema, wherein the unified schema describes both (a) the object described by the inferred schema and (b) a cumulative set of objects described by the cumulative schema, and (iii) storing the unified schema as the cumulative schema. The method further includes exporting the data of each of the retrieved objects to a data warehouse.Type: ApplicationFiled: March 14, 2014Publication date: September 18, 2014Applicant: AMIATO, INC.Inventors: Dimitris TSIROGIANNIS, Nathan A. BINKERT, Stavros HARIZOPOULOS, Mehul A. SHAH, Benjamin SOWELL, Bryan D. KAPLAN, Kevin R. MEYER
-
Publication number: 20140279828Abstract: A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data.Type: ApplicationFiled: March 13, 2013Publication date: September 18, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
-
Publication number: 20140279833Abstract: Various embodiments are directed to techniques for providing one or more reduced-size rule cubes indicating cell rules. A computer-program product embodied in a machine-readable storage medium includes instructions to cause a computing device to select a cell rule to include in a rule cube based on applicability of the cell rule to a selected portion of a data cube; analyze the cell rule to identify a wildcarded dimension in a specification of cells of the data cube that are subject to the cell rule; and generate the rule cube indicating applicability of the cell rule to the selected portion of the data cube, wherein a cell of the rule cube corresponds to multiple cells of the data cube, and wherein the wildcarded dimension of the rule cube is reduced in length in comparison to a length of the wildcarded dimension of the data cube. Other embodiments are described and claimed.Type: ApplicationFiled: March 13, 2014Publication date: September 18, 2014Applicant: SAS INSTITUTE INC.Inventors: Qing Gong, Amy Spelman Davis, Ryan Norris, Enrico Stevanoni
-
Publication number: 20140279827Abstract: A method, system, and computer program product for managing time dimension tables are disclosed. In a first aspect, the method comprises providing an intelligent time dimension agent application and optimizing a time dimension table by the intelligent time dimension agent based upon changing data in associated fact tables. In a second aspect, the system comprises a processor and a memory coupled to the processor, wherein the memory includes an application which, when executed by the processor, causes the processor to carry out the steps of the method. In a third aspect, the computer program product comprises a non-transitory computer readable medium having a computer readable program recorded thereon, wherein the computer readable program, when executed on a computer device, causes the computer device to carry out the steps of the method.Type: ApplicationFiled: March 12, 2013Publication date: September 18, 2014Applicant: International Business Machines CorporationInventors: Joseph P. Bigus, Leon Gong
-
Publication number: 20140279829Abstract: A method includes receiving data from a non-geo enabled data source, obtaining information related to location in the received data, converting the obtained information to a standardized form of geo-location data, and storing the geo-location data.Type: ApplicationFiled: March 13, 2013Publication date: September 18, 2014Applicant: SAP AGInventor: Manfred Reinart
-
Publication number: 20140279826Abstract: A data feeds platform, comprising a data feed using a data feed protocol. The data feed protocol comprises, a feed header defining at least one identification attribute, a data header defining a data structure comprising data nodes, each data node corresponding to a subset of the data, and a data section, the data section storing the data according to the data structure. A plurality of heterogeneous data protocols is stored. A data input is received in one of the plurality of heterogeneous data protocols. The data input is converted to the data feed protocol, creating the data feed.Type: ApplicationFiled: March 12, 2013Publication date: September 18, 2014Applicant: MORGAN STANLEYInventors: Bhavin P. Dave, Amit S. Modi
-
Publication number: 20140279830Abstract: Approaches for data integration between multiple IT sources using automated data processing based on target metadata are provided. Specifically, an integration component is configured to load a mapped data set into a table with delta processing based on a configuration object containing, e.g., the source data location, target table name, and source to target mapping. The integration component uses the target metadata to validate data, identify changes, generate the necessary database programming language (e.g., structured query language (SQL)), and run the database programming language with data binding to perform the actual data updates. The integration component leverages the data target metadata to automate the processing of source data, thus providing a way to validate the data, and identify delta changes at the field level between the source and target. This significantly reduces the overall development effort, while providing consistency in record handling and error reporting.Type: ApplicationFiled: March 15, 2013Publication date: September 18, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Gautam Majumdar, Tarun K. Saha, Michael Q. Wang
-
Publication number: 20140279835Abstract: Techniques are disclosed to determine data quality issues in data processing jobs. The data processing job is received, the data processing job specifying one or more processing steps designed based on one or more data schemas and further specifies one or more desired quality metrics to measure at the one or more processing steps. One or more state machines are provided, that are generated based on the quality metrics and on the data schemas. Input data to the data process job are processed using the one or more state machines, in order to generate output data and a set of data quality records characterizing a set of data quality issues identified during the execution of the data processing job.Type: ApplicationFiled: March 25, 2014Publication date: September 18, 2014Applicant: International Business Machines CorporationInventors: Jeff J. LI, Yong LI
-
Patent number: 8839124Abstract: An apparatus for configuring a diagnostics module for an industrial plant includes an application module, which is configured to run on a computer and includes a graphics builder. The graphics builder includes a workspace configured to present a display area on a monitor of the computer, the display area configured to be shown by the diagnostics module on a client machine, and graphics tools configured to be manipulated via at least one user input device of the computer for customization of the display area.Type: GrantFiled: August 3, 2012Date of Patent: September 16, 2014Assignee: Yokogawa Electric CorporationInventor: Ryan Justiza
-
Patent number: 8838525Abstract: In one embodiment, a method for communicating data is provided. The method includes defining a data transmission unit (DTU) to be sent in an xDSL data stream, defining a retransmit container as a time slot that corresponds to the DTU, maintaining, in a retransmit container, a copy of the DTU and an index corresponding to the retransmit container, the retransmit container being maintained at the second device independently of the first device, transmitting the DTU in the xDSL data stream, determining whether the DTU should be retransmitted, and if the DTU should be retransmitted: identifying the DTU by the retransmit container and retransmitting a copy of the DTU as identified by the retransmit container when the DTU reaches a predetermined position in the retransmission buffer.Type: GrantFiled: June 21, 2012Date of Patent: September 16, 2014Assignee: Broadcom CorporationInventors: Benoit Christiaens, Miguel Peeters, Raphael Cassiers
-
Patent number: 8838626Abstract: Embodiments of techniques and systems for parallel XML parsing are described. An event-level XML parser may include a lightweight events partitioning stage, parallel events parsing stages, and a post-processing stage. The events partition may pick out event boundaries using single-instruction, multiple-data instructions to find occurrences of the “<” character, marking event boundaries. Subsequent checking may be performed to help identify other event boundaries, as well as non-boundary instances of the “<” character. During events parsing, unresolved items, such as namespace resolution or matching of start and end elements, may be recorded in structure metadata. This structure metadata may be used during the subsequent post-processing to perform a check of the XML data. If the XML data is well-formed, individual sub-event streams formed by the events parsing processes may be assembled into a flat result event stream structure. Other embodiments may be described and claimed.Type: GrantFiled: December 17, 2009Date of Patent: September 16, 2014Assignee: Intel CorporationInventors: Zhiqiang Yu, Yuejian Fang, Lei Zhai, Yun Wang, Zhonghai Wu, Mo Dai
-
Publication number: 20140258209Abstract: Systems and methods for interest-driven business intelligence systems in accordance with embodiments of the invention are illustrated. In one embodiment, an interest-driven distributed data server system includes a processor, an interest-driven distributed data server application, aggregate data storage, and a data source directory, wherein the interest-driven distributed data server application configures the processor to obtain reporting data requirements, determine at least one data dimension using the reporting data requirements, identify at least one data source identified that can provide data associated with the data dimension, generate data source job data, transmit the data source job data to at least one data source, and receive job response data, store the received job response data in the aggregate data storage, build reporting data based on the reporting data requirements, and transmit the reporting data.Type: ApplicationFiled: March 8, 2013Publication date: September 11, 2014Applicant: Platfora, Inc.Inventors: John Glenn Eshleman, Kevin Scott Beyer, Brian Babcock, Carl-Christian Kanne
-
Publication number: 20140258210Abstract: The present subject matter relates to generation of business intelligence reports with navigable reference indicators. The method comprises obtaining information from an information source for providing in a business intelligence report. Further, report parameters are obtained through a user interface, wherein the report parameters include at least notes and indicator parameters. The notes and indicator parameters indicate a plurality of navigable reference indicators to be included in the business intelligence report. The method further comprises mapping the notes and indicator parameters with the obtained information. Based on the mapping, the business intelligence report is generated with the plurality of navigable reference indicators. Further, the plurality of navigable reference indicators provides a forward and backward link between two or more sections of the business intelligence report.Type: ApplicationFiled: February 18, 2014Publication date: September 11, 2014Applicant: Tata Consultancy Services LimitedInventor: Malukanparampil Meer Mohammed Riyaz
-
Patent number: 8832020Abstract: A computer system architecture and process transforms the organizing principles of business data management seen in traditional or common place settings across disparate data stores into an object-semantic model. The transformation is implemented in a transitory, managed risk implementation approach for data access migration. This approach will enable data agility and will move data storage closer to information storage, while supporting the rapid development of business capabilities. This approach will bridge the gaps between relational database/file based storage (less expressive) and object-semantic representation (more expressive). This process/method will connect information residing in legacy data structures such as relational database/file based storage on disk drives to instances populating an expressive object-semantic model in persistent grid based memory.Type: GrantFiled: November 15, 2011Date of Patent: September 9, 2014Assignee: Medco Health Solutions, Inc.Inventors: Mark Joseph Landy, Jimmy Chou, Tej Anand, Tanya Kaplun, Sarah Mord Myers, Peter Foreman
-
Patent number: 8832016Abstract: An approach is provided for providing recommendation services while protecting the privacy of personal user information. The approach involves determining a request for at least one recommendation, the request specifying at least in part an anonymized user preference record. The approach also involves causing, at least in part, a comparison of the anonymized user preference record against one or more previously stored anonymized user preference records. The approach further involves causing, at least in part, a selection of the anonymized user preference record, the one or more previously stored anonymized user preference records, or a combination thereof based, at least in part, on the comparison. The approach additionally involves determining to generate the at least one recommendation based, at least in part, on the selection.Type: GrantFiled: December 9, 2011Date of Patent: September 9, 2014Assignee: Nokia CorporationInventor: Jonathan Ledlie
-
Patent number: 8832158Abstract: An approach is provided in which a processor receives a scan request to scan data included in a data table. The processor selects a column in the data table corresponding to the scan request and retrieves column data entries from the selected column. In addition, the processor identifies the width of the selected column and selects a scan algorithm based upon the identified column width. In turn, the processor loads the column data entries into column data vectors and computes scan results from the column data vectors using the selected scan algorithm.Type: GrantFiled: March 29, 2012Date of Patent: September 9, 2014Assignee: International Business Machines CorporationInventors: Eduard Diner, Jens Leenstra, Vijayshankar Raman, Adam J. Storm
-
Patent number: 8832601Abstract: A method and apparatus for an ETL tool utilizing dimension trees. The method may include identifying various dimensions associated with one or more databases, and creating dimension trees based on these dimensions. The method may further include creating an ETL pyramid having multiple levels, linking the dimension trees to corresponding levels of the ETL pyramid, and creating a table for each level of the ETL pyramid to store a subset of the dimension trees that is linked to a corresponding level of the ETL pyramid.Type: GrantFiled: May 31, 2008Date of Patent: September 9, 2014Assignee: Red Hat, Inc.Inventor: Eric J. Williamson
-
Publication number: 20140250054Abstract: A data management system for calculating and retrieving analytic data involves a computing device coupled to a database. The system includes a measure data structure within the database including a measure-data element, a collection data structure within the database including a collection-data element, and a fact data structure within the database, and configured to store a fact-data element which includes an aggregated value, a link to the measure-data element, and a link to the collection-data element. The system also includes a fact-analyzer module configured to generate the aggregated value from the transaction data utilizing at least the measure-data element and the collection-data element, wherein the measure-data element defines how the aggregated value is calculated and the collection-data element defines a categorization of the aggregated value.Type: ApplicationFiled: March 4, 2013Publication date: September 4, 2014Applicant: MASTERCARD INTERNATIONAL INCORPORATEDInventor: William C. Jones
-
Patent number: 8825592Abstract: A computer-implemented method of extracting data from a document in an electronic format. The method includes the steps of accessing a file in an electronic format from a memory module; extracting data from the file corresponding to a plurality of keys contained within a mapping structure stored in the memory module; organizing the extracted data into values, wherein each value maps to one of the plurality of keys to form a hash map; storing the hash map in a database; and providing a user access to the database via an output device. The output device allows the user to view a customizable document whose content is derived from the values and keys stored in the database.Type: GrantFiled: March 10, 2009Date of Patent: September 2, 2014Assignee: Web Access, Inc.Inventors: Stephen A. Lobo, Claudian Fernandes
-
Patent number: 8825628Abstract: The disclosure presents a method, system and computer-readable medium related to automatically analyzing structure for a web page. The method embodiment comprises building a training corpus comprising a broad stylistic coverage of web pages, segmenting a web page into information blocks, identifying semantic categories of the information blocks using the training corpus and applying the identical semantic categories in a web-based tool.Type: GrantFiled: August 18, 2010Date of Patent: September 2, 2014Assignee: AT&T Intellectual Property II, L.P.Inventors: Junlan Feng, Barbara B. Hollister
-
Patent number: 8826125Abstract: A news reader is disclosed which selects the optimal articles for presentation to the user and which facilitates a sense of community among users. Upon receiving an article from a news provider, the news reader analyzes characteristics associated with the article. Amongst other things, the characteristics may indicate whether the article represents a breaking news story or whether the article includes advertisements. After analyzing the characteristics of the article, the news reader determines whether the article is to be designated a primary/presentable article and aggregates the article with other primary/presentable articles related to the same topic.Type: GrantFiled: March 12, 2012Date of Patent: September 2, 2014Assignee: Hyperion Media LLCInventor: Nelson Chu
-
Patent number: 8826277Abstract: High level characteristics are extracted from a virtual machine provisioning request stream for a cloud computing environment. A plurality of prediction models are generated for a plurality of different types of virtual machines. A regression-correlation ensemble is applied to appropriate ones of the prediction models to predict future virtual machine demand for the cloud computing environment.Type: GrantFiled: November 29, 2011Date of Patent: September 2, 2014Assignee: International Business Machines CorporationInventors: Rong N. Chang, Mihwa Choi, Yexi Jiang, Meir Laker, Chang-Shing Perng, Hidayatullah H. Shaikh, Edward C. So, Tao Tao
-
Patent number: 8825593Abstract: A system for aggregating data from a plurality of operational databases, and a method for providing the same, are provided. The system includes a data store storing a collection of pre-defined extraction scripts. The extraction scripts identify data available for extraction from a plurality of operational database products, and including parameters for specifying which of the data to extract. A data warehouse server executes the extraction scripts to extract, merge, transform and load the specified data from the plurality of operational databases into a multidimensional database.Type: GrantFiled: March 26, 2010Date of Patent: September 2, 2014Assignee: Trapeze Software ULCInventors: Matthew Dodds, Chacko Kattithara Paul
-
Patent number: 8825591Abstract: System and method for tracking statistics at the subfile level and transparently placing or migrating inactive or less active blocks of data to other storage devices. Embodiments may provide mechanisms to track statistics at the subfile level of files including, but not limited to, database files, and to transparently place or migrate inactive or less active blocks of data of the files from higher-performing, typically more expensive, storage to lower-performing, typically less expensive, storage, while placing or migrating active blocks of data of the files to higher-performing storage, based on the subfile-level statistics rather than on file-level timestamps. In some embodiments, knowledge of file structure (e.g., database file structure), for example knowledge of database partitions with header blocks and data blocks in databases using data partitioning, may be used to separate more active and less active blocks of data of files onto storage with different performance and/or other characteristics.Type: GrantFiled: December 31, 2003Date of Patent: September 2, 2014Assignee: Symantec Operating CorporationInventors: Xinyi David Lai, Par Botes, HanCheng Hsiung
-
Publication number: 20140244570Abstract: Disclosed herein are techniques for optimizing and managing the execution of hybrid flows. An execution plan is generated for each hybrid flow based at least partially on attributes associated therewith. The execution of each hybrid flow is managed in accordance with the execution plan.Type: ApplicationFiled: February 28, 2013Publication date: August 28, 2014Applicant: Hewlett-Packard Development Company, L.P.Inventors: Alkiviadis Simitsis, William K. Wilkinson, Umeshwar Dayal
-
Publication number: 20140236884Abstract: Embodiments of systems, methods, and computer-readable mediums for analysis of datasets without predefined dimensions are generally described herein. In some embodiments, analysis of datasets without predefined dimensions may include receiving a selection of multiple dimensions of information from a database to be used for data analysis; receiving a selection for a type of report; and dynamically generating a query statement based on the selection of the multiple dimensions. Further embodiments may include the dynamically generated query statement including multiple iterative join clauses as a function of the selected dimensions; executing the query statement against an in-memory database; and displaying columns of information obtained from execution of the query statement.Type: ApplicationFiled: February 15, 2013Publication date: August 21, 2014Applicant: SAP AGInventors: Mitchell Clark, Celso da Silveira, Julian Ogando
-
Patent number: 8812449Abstract: A storage system having a plurality of storage devices including a first type storage device and a second type storage device, a reliability attribute and/or a performance attribute of the first type storage device being different from a reliability attribute and/or a performance attribute of the second type storage device. The storage system also has a control unit and managing a plurality of virtual volumes. If necessary, a storage area allocated to a first portion of a virtual volume of the plurality of virtual volumes is changed from a first type storage area of the plurality of first type storage areas to a second type storage area of the plurality of second type storage areas while another first type storage area of the plurality of first type storage areas is allocated to a second portion of the virtual volume.Type: GrantFiled: September 10, 2012Date of Patent: August 19, 2014Assignee: Hitachi, Ltd.Inventors: Teiko Kezuka, Akira Murotani, Seiichi Higaki
-
Patent number: 8812433Abstract: Multi-dimensional surrogation systems and methods are provided that generate at least one up-to-date data surrogate using information of data and numerous data changes received from at least one data source, including a production system. Embodiments described further perform bulk-to-brick transformation of the up-to-date data surrogate. Brick-level data, or item data, is further indexed and stored in an application-aware manner. The stored item data is easily accessible by many data management applications for integrated item search and recovery functions, audit functions, monitoring and supervision functions, legal discovery functions, compliance functions, archival functions, backup functions, disaster recovery functions, and more. Bulk-to-brick transformation and access of the stored item data occur off of the production system, thus contributing no performance degradation to the production system.Type: GrantFiled: October 2, 2006Date of Patent: August 19, 2014Assignee: Mimosa Systems, Inc.Inventors: Roy P. D'Souza, Thirumalai Muppur Ravi