Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 8812434
    Abstract: A technique for monitoring software which efficiently communicates transaction trace data, including static and dynamic data, from an agent to a manager is disclosed. A tree data structure describes transactions as a sequence of transaction elements. Each branch may correspond to one type of transaction. The nodes along a branch may be associated with components of monitored software that are invoked to perform the transaction. A provisional branch may be constructed when an agent is unable to initially identify the transaction. Transaction data may be stored in association with the nodes in the provisional branch during the monitoring. After information needed to identify the transaction becomes available to the agent, the provisional branch is updated. The transaction data is reported in response to determining the branch in the tree data structure that matches the provisional branch.
    Type: Grant
    Filed: October 12, 2012
    Date of Patent: August 19, 2014
    Assignee: CA, Inc.
    Inventors: Marco Gagliardi, Martin Tali, Ramesh Mani
  • Patent number: 8812806
    Abstract: Method and system for migrating a virtual storage system from a source storage system having access to a source storage device to a destination storage system having access to a destination storage device is provided. A processor executable management application estimates a likelihood of success for a migration operation before the migration operation enters a cut-over duration during which client access to the source storage system and the destination storage system is restricted. The migration operation enters the cut-over duration if there is high likelihood of success for completing the migration during the cut-over duration or aborted, if there is a low likelihood of success for completing the migration during the cut-over duration.
    Type: Grant
    Filed: October 29, 2010
    Date of Patent: August 19, 2014
    Assignee: Netapp, Inc.
    Inventors: John Fredricksen, Chung-Yen Chang, Nagender Somavarapu
  • Publication number: 20140229424
    Abstract: Systems and methods for mapping and propagating a source dataset to a plurality of target tables in a target database are described herein. Embodiments include a physical database design with a template-based loader, and a method to propagate changes in the source dataset to a target database. Other embodiments include a database physical design with a plurality of small fact and summary tables. Still other embodiments include a method of extracting and loading data automatically into a target database, while simultaneously maintaining current summary tables.
    Type: Application
    Filed: January 31, 2014
    Publication date: August 14, 2014
    Applicant: Telduráõgevin Sp/f
    Inventor: Eyjólfur Gislason
  • Publication number: 20140229425
    Abstract: A system and method for a collection of digital items that includes generating a collection of result items, wherein each result item corresponds to result data of at least one query input, and wherein at least one result item includes a context property; storing the collection of result items as a first instance; and augmenting the collection in a second instance through at least the context property.
    Type: Application
    Filed: February 10, 2014
    Publication date: August 14, 2014
    Applicant: Verbify, Inc.
    Inventor: Bobby Lo
  • Publication number: 20140229423
    Abstract: An extract-transform-load (ETL) platform fetches consistent datasets in a batch for a given period of time and provides the ability to rollback that batch. The batch may be fetched for an interval of time, and the ETL platform may fetch new or changed data from different cloud/on-premise applications. It will store this data in the cloud or on-premise to build data history. As the ETL platform fetches new data, the system will not overwrite existing data, but rather will create new versions so that change history is preserved. For any reason, if businesses would like to rollback data, they could rollback to any previous batch.
    Type: Application
    Filed: February 11, 2013
    Publication date: August 14, 2014
    Inventors: Ketan Bengali, Kaniska Mandal, Alex J. Chen
  • Patent number: 8805864
    Abstract: Access to a networked application can be provided to multiple users while allowing user-specific extension fields to be created and maintained for exclusive access by the user creating the extension field. A user-customized data object that includes a standard field value of a standard field of a standard data object defined by the networked application and a user-specific extension field value of a user-specific extension field that modifies operation of the networked application for the user and that is not available to other users of the plurality of users can be received from a user for writing to memory. The user-specific extension field value can be separated from the standard field value. The standard field value and the user-specific extension field value can be persisted in a first database table and a second database table, respectively. Related systems, methods, and articles of manufacture are also provided.
    Type: Grant
    Filed: May 29, 2012
    Date of Patent: August 12, 2014
    Assignee: SAP AG
    Inventor: Daniel Figus
  • Patent number: 8805798
    Abstract: A computer-implemented method comprising partitioning data representing an input instance of a database including multiple tuples into multiple fragments of tuples, detecting tuples which violate a data quality specification in respective ones of the fragments, selecting a data cleaning asset on the basis of characteristics of errors in detected tuples for a fragment and based on declared asset capabilities, assigning a selected data cleaning asset to the fragment, the selected data cleaning asset to provide a set of candidate corrections for the detected tuples in the fragment, providing data representing an output instance of the database in which detected tuples are replaced with selected candidate corrections.
    Type: Grant
    Filed: May 10, 2012
    Date of Patent: August 12, 2014
    Assignee: Qatar Foundation
    Inventors: Ihab Francis Ilyas Kaldas, George Beskales, Ahmed Elmagarmid
  • Patent number: 8799207
    Abstract: Some embodiments provide a program that identifies a set of data storages that includes data for a set of data consuming entities. For each data storage, the program identifies a set of data source objects that represents data in the data storage. The program provisions a unified data source that includes the identified sets of data source objects. The program provides the identified sets of data source objects through the unified data source to the data consuming entities in order for the data consuming entities to access the data in the set of data storages.
    Type: Grant
    Filed: October 8, 2011
    Date of Patent: August 5, 2014
    Assignee: Bay Dynamics
    Inventors: Ryan G. Stolte, Firas S. Rifai
  • Patent number: 8799288
    Abstract: A generic and expandable document aspect system and method for searching, browsing, presenting, and interacting with data assembled from document contents and related external data is provided. New varieties of document aspects are added to existing installations and can be accessed by users without requiring upgrades to server or clients, for example by using plug-in technology.
    Type: Grant
    Filed: October 31, 2011
    Date of Patent: August 5, 2014
    Assignee: Ebrary
    Inventors: Richard Holzgrafe, Tom Santos, Christopher Warnock
  • Patent number: 8799206
    Abstract: Multi-dimensional surrogation systems and methods are provided that generate at least one up-to-date data surrogate using information of data and numerous data changes received from at least one data source, including a production system. Embodiments described further perform bulk-to-brick transformation of the up-to-date data surrogate. Brick-level data, or item data, is further indexed and stored in an application-aware manner. The stored item data is easily accessible by many data management applications for integrated item search and recovery functions, audit functions, monitoring and supervision functions, legal discovery functions, compliance functions, archival functions, backup functions, disaster recovery functions, and more. Bulk-to-brick transformation and access of the stored item data occur off of the production system, thus contributing no performance degradation to the production system.
    Type: Grant
    Filed: October 2, 2006
    Date of Patent: August 5, 2014
    Assignee: Mimosa Systems, Inc.
    Inventors: Roy P. D'Souza, Thirumalai Muppur Ravi
  • Patent number: 8798988
    Abstract: One embodiment of the present invention provides a system that uses a known set of related terms to identify additional related terms in different languages. During operation, the system receives two terms that are known to be related. The system translates these two non-synonym terms into a given language, and then adds the translations of the two related terms to a list of known related pairs.
    Type: Grant
    Filed: October 23, 2007
    Date of Patent: August 5, 2014
    Assignee: Google Inc.
    Inventors: Trystan G. Upstill, Steven D. Baker
  • Publication number: 20140214753
    Abstract: Preferred embodiments of the invention provide systems and methods for improving the speed and efficiency of a data warehouse. The invention enables the loading of data from different data sources into a common data warehouse structure. Preferred embodiments include an ETL process is modified to perform a joined indexing operation which reduces the number of lookup requests required. Further embodiments contemplate a date dimension and hierarchical data structure which improve operation speed.
    Type: Application
    Filed: December 27, 2013
    Publication date: July 31, 2014
    Inventor: Joseph Guerra
  • Patent number: 8793213
    Abstract: A central data warehouse includes embedded data marts. These embedded data marts, referred to as workspaces, are assigned centrally manage data by reference only but rely directly on the centrally managed data and the underlying infrastructure. Workspaces still allow departments in an enterprise to perform certain actions on their own (like adding new data and building new models) without having to instantiate copies of the centrally managed data in a locally managed data mart.
    Type: Grant
    Filed: May 23, 2012
    Date of Patent: July 29, 2014
    Assignee: SAP AG
    Inventors: Klaus Nagel, Rainer Schaefer, Silvia Bratz, Marcus Hoepfner, Patrick Winkler, Matthias Jensen, Dirk Janning
  • Publication number: 20140207729
    Abstract: An approach is provided in which a system creates schema terms based upon matching input data query requirements to industry terms. In turn, the system generates a query and an associative map, which includes data organized according to the schema terms. The system executes the query, which retrieves the data from the associative map and loads the data into one or more storage areas.
    Type: Application
    Filed: March 22, 2014
    Publication date: July 24, 2014
    Applicant: International Business Machines Corporation
    Inventor: Manoj Kumar
  • Publication number: 20140207728
    Abstract: Systems and methods for storing and retrieving data elements transmitted via data streams received from distributed devices connected via a network. The received data elements may be stored in block stores on the distributed devices. The stored data-elements may be allocated to data-blocks of a block-store that have assigned block-identifiers and further allocated to events of the data-blocks that have assigned token-names. Stream-schema of the received data-streams may comprise a list of token-names and an index-definition for each corresponding data-stream. Indices may be generated for the event-allocated data-elements. A query may be executed in order to retrieve data-elements of the received data-streams based on the indices.
    Type: Application
    Filed: March 3, 2014
    Publication date: July 24, 2014
    Applicant: Red Lambda, Inc.
    Inventors: Robert Bird, Adam Leko, Matthew Whitlock
  • Patent number: 8788451
    Abstract: Search systems and methods are disclosed. Content and data in support of search results are stored in a central repository. Moreover, the centrally stored content and data can be made available to one or more authorized devices. Searches can be conducted across a single device, or a community of devices. The community of devices over which a search is performed can include devices that are not publicly accessible, and that require the provision of authentication information. The community of devices over which a search is performed, and the set of authorized devices, can be determined based on various factors.
    Type: Grant
    Filed: February 18, 2011
    Date of Patent: July 22, 2014
    Assignee: Avaya Inc.
    Inventor: Terry Don Jennings
  • Publication number: 20140201129
    Abstract: A probabilistic data structure is generated for efficient query processing using a histogram for unsorted data in a column of a columnar database. A bucket range size is determined for multiples buckets of a histogram of a column in a columnar database table. In at least some embodiments, the histogram may be a height-balanced histogram. A probabilistic data structure is generated to indicate for which particular buckets in the histogram there is a data value stored in the data block. When an indication of a query directed to the column for select data is received, the probabilistic data structure for each of the data blocks storing data for the column may be examined to determine particular ones of the data blocks which do not need to be read in order to service the query for the select data.
    Type: Application
    Filed: January 15, 2013
    Publication date: July 17, 2014
    Applicant: AMAZON TECHNOLOGIES, INC.
    Inventor: AMAZON TECHNOLOGIES, INC.
  • Patent number: 8781996
    Abstract: Systems, methods and computer-readable media process a series of media files into a searchable format. The method includes generating a media database by processing each of a plurality of programs. The steps of the method include extracting a subtitle track from each of the programs, retrieving at least one frame associated with the subtitle track, adding metadata to the extracted subtitle track and at least one frame, processing the subtitle track, program statistics and at least one frame in a media database. Another aspect includes receiving a user query associated with dialog in a program, searching the media database and presenting a listing of results, receiving a user selection or program from their listing and transmitting at least one frame, a portion of associated subtitle track and prompts for ordering the program.
    Type: Grant
    Filed: July 3, 2008
    Date of Patent: July 15, 2014
    Assignee: AT&T Intellectual Property II, L.P.
    Inventor: Simon D. Byers
  • Patent number: 8782101
    Abstract: Methods and systems for transferring data across incompatible database platforms are provided. A method may include reading data from a database export file containing a representation of objects and data from a source database of a first database platform, where the file was generated by an export utility of the first database platform, has a format specific to the first database platform, and is stored outside of the source database. The data read from the file is parsed based on the format. At least one database table represented in the file is identified based on the parsing. Row-level data associated with the at least one database table is also identified based on the parsing. The row-level data is collected in a buffer and is then inserted into a destination database of a second database platform using at least one dedicated data insertion thread.
    Type: Grant
    Filed: January 20, 2012
    Date of Patent: July 15, 2014
    Assignee: Google Inc.
    Inventor: Matthew Aaron Moore
  • Patent number: 8775472
    Abstract: A system in accordance with the present invention may include one or more processors, memory from which the processor may fetch instructions according to a clock operating at a frequency, a display device, and one or more programs stored in the memory, with instructions to create a structured file for storing data related to the media presentation, access the data related to the media presentation from the presentation application, organize the data related to the media presentation according to an event, and export the organized data related to the media presentation to the structured file. These instructions may also be executed as steps of a method and may be fetched during the execution of one or more programs stored in a computer-readable storage medium.
    Type: Grant
    Filed: August 13, 2010
    Date of Patent: July 8, 2014
    Assignee: Apple Inc.
    Inventors: Michael Peirce, Eric Tilton, David Mack, Mark Flider
  • Patent number: 8776009
    Abstract: A system for task modeling interactive sequential applications for one or more mobile devices has a software tool for defining a generic meta-model of the target applications. The generic meta-model has a static model of application components, and a dynamic model of identifiers of application screens and connections; a passive Task Model database for storing the generic meta-model for future reuse, which further comprises static instances of the generic meta-model specifically generated for a variety of mobile devices; a tracker module for real-time monitoring and creating unique identifier for each application's screen visited by the user of the mobile device, wherein created identifiers are used for generating active models of the user actual use; and an active Task Model database for storing the active models, wherein data transferring and storing is minimized to screen identifier numbers.
    Type: Grant
    Filed: March 10, 2010
    Date of Patent: July 8, 2014
    Assignee: Deutsche Telekom AG
    Inventors: Aviram Dayan, Meytal Tubi, David Mimran, Bracha Shapira, Peretz Shoval, Meira Levy, Katja Henke, Gregor Glass, Lutz Schneider
  • Patent number: 8775425
    Abstract: Methods and arrangements for accommodating a query, directing the query to datasets, creating partitions and partitioning the datasets, and returning a response to the query, the response being structured in accordance with the created partitions.
    Type: Grant
    Filed: August 24, 2010
    Date of Patent: July 8, 2014
    Assignee: International Business Machines Corporation
    Inventors: Himanshu Gupta, Rajeev Gupta, Mukesh Kumar Mohania, Ullas Balan Nambiar
  • Publication number: 20140188784
    Abstract: Certain embodiments contemplate systems and methods for improving the speed and efficiency of a data warehouse. In some embodiments, an ETL process is modified to perform a joined indexing operation which reduces the number of lookup requests required. Certain embodiments contemplate a date dimension and hierarchical data structure which improve operation speed. Still other embodiments contemplate structural organizations of biographical fact tables to better improve data access.
    Type: Application
    Filed: March 15, 2013
    Publication date: July 3, 2014
    Inventor: Joseph Guerra
  • Publication number: 20140188786
    Abstract: A method and system for providing contextual filters respective of an identified context of a plurality of multimedia content elements are provided. The method comprises receiving the plurality of multimedia content elements; generating at least one signature for each of the plurality of multimedia content elements; determining a context of each of the plurality of multimedia content elements based on its respective at least one signature, wherein a context is determined as the correlation among a plurality of cluster of signatures; and providing at least one contextual filter respective of the context of each of the plurality of multimedia content elements.
    Type: Application
    Filed: March 5, 2014
    Publication date: July 3, 2014
    Applicant: CORTICA, LTD.
    Inventors: Igal Raichelgauz, Karina Odinaev, Yehoshua Y. Zeevi
  • Publication number: 20140188783
    Abstract: A log file contains operation records, each operation record is of a certain type, and each operation record is associated with a transaction. A plurality of operation records is read from the log file into a record store. Records of the plurality of operation records of each operation record type are sampled at a predefined sampling rate. Operation records in the plurality of operations records are identified that are associated with completed transactions of which the sampled operation records are associated. The identified operation records are then extracted from the record store into a data store.
    Type: Application
    Filed: January 3, 2013
    Publication date: July 3, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Manoj K. Agarwal, Curt L. Cotner, Amitava Kundu
  • Publication number: 20140188785
    Abstract: A process includes selecting a node associated with an extracted node and a node associated with the node accepted by an acceptance unit accepting a node from nodes included in a node link obtained from text relating to a specific matter, accepting a first item in which a registered content of an analysis target is registered, from items in a table relating to the specific matter, accepting the registered content of the analysis target, a second item for use in calculating degrees of association between registered contents in the first item in the table and nodes; calculating the degrees of association between each of the registered content in the first item in the table and the nodes selected, according to a registered content in the second item in the table; and extracting a node associated with the analysis target from the selected nodes, according to the calculated degrees of association.
    Type: Application
    Filed: October 23, 2013
    Publication date: July 3, 2014
    Applicant: FUJITSU LIMITED
    Inventors: Etsuhisa ASAI, Harumichi TABATA, Machiko IKOMA, Nozomi WATANABE
  • Patent number: 8766981
    Abstract: Embodiments are directed towards budgeting and forecasting of information technology and services. In at least one of the various embodiments, financial allocation models may include multiple cost objects and one or more allocation rules. In at least one of the various embodiments, allocation rules may determine how costs associated with cost objects may be allocated to other cost objects. In at least one of the various embodiments, costs corresponding to one or more selected cost objects may be traced within a financial allocation model. Also, costs associated with multiple cost line items from different cost objects may simultaneously traced for efficient cross-referencing of disparate cost line items. Further, in at least one of the various embodiments, a display of the cost traces may be rendered and displayed to provide a visualization of the traced costs.
    Type: Grant
    Filed: June 13, 2013
    Date of Patent: July 1, 2014
    Assignee: Apptio, Inc.
    Inventors: Paul Damien McLachlan, William Byron Moeller, Brian John Bero, Tavis Dean Elliott
  • Patent number: 8768880
    Abstract: A transformation method and system is provided. The method includes generating a data hub application configured to embed extract, transform, and load (ETL) processes. The data hub application is linked to source tables and target tables. Meta data associated with the source and target tables is transferred from virtual views of the data hub application to an ETL work area of the ETL processes. An ETL job is generated and linked to the data hub application. ETL processes are executed and results are determined.
    Type: Grant
    Filed: September 30, 2013
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Arundhathi Erla, Ritesh K. Gupta, Madhusmita P. Patil, Swetha Patil, Ramesh Rajagopalan, Bijo A. Thomas
  • Patent number: 8768879
    Abstract: Methods of storing data records produced from monitoring interactions between external agents and a system are described. The method defines specific interactions that occur between the external agents and the system as events of interest. A chain of interactions occurring during respective interaction sessions between a respective external agent and the system are monitored and events of interest occurring in the chain are determined. Data records from the monitored chain are produced, the respective data record including data identifying determined events of interest and data associated therewith. A profile identity, representative of the external agent, is assigned to each data record produced during an interaction session. Data records of individual events of interest are stored in a way ordered according to the type of event of interest and data records of events of interest occurring during an interaction session are stored in a way ordered according to assigned profile identity.
    Type: Grant
    Filed: January 24, 2011
    Date of Patent: July 1, 2014
    Assignee: Nice Systems Technologies UK Limited
    Inventors: Alan Paul Rolleston Phillips, John Graham-Cumming, Gareth O'Loughlin, Jason Derek McFall, Paul David Mellor, Neil Samuel Ferguson, Alfredo Ramos-Alvarez, Liam Philip Clancy, Fiann James Curry-Towneley-O'Hagan, Andrew Galloni, Steven Heron, Maciej Buczkowski, Panagiotis Belesis, Benjamin Thomas Pickering
  • Patent number: 8768878
    Abstract: One or more embodiments characterize workloads in a data warehouse system. A set of accounting data associated with a data warehouse system comprising at least one database is collected. A set of query concurrency information associated with the database is determined determining based on the set of accounting data. The set of query concurrency information identifies a plurality of queries executed on the database simultaneously and a duration of this execution. A set of processor utilization distribution information associated with the plurality of queries is determined based on the set of accounting data. A set of data access information indicating a quantity of data accessed by each query in the plurality of queries is determined. A workload profile associated with the database is generated based on the set of query concurrency information, the set of processor utilization distribution information, and the set of data access information.
    Type: Grant
    Filed: January 21, 2011
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Gary W. Crupi, Chon N. Lei, Mei Hing Jackson
  • Publication number: 20140181004
    Abstract: A system, a method, and a computer-program product for management of runtime-loads in business process applications are disclosed. Metadata associated with an application is identified. The metadata is configured to enable loading during a runtime of the application and is stored on at least one node in a metadata repository. A runtime-loading structure for loading the metadata during the runtime of the application is generated based on the identified metadata. The runtime-loading structure contains an identification of the at least one node storing the identified metadata in the metadata repository. The identified metadata is loaded based on the generated runtime-loading structure during the runtime of the application.
    Type: Application
    Filed: December 20, 2012
    Publication date: June 26, 2014
    Applicant: Sap AG
    Inventors: Bare Said, Frank Jentsch, Frank Brunswig
  • Publication number: 20140181006
    Abstract: A controller is coupled to an Extract-Transform-Load (ETL) processor, which is connected to data storage devices. The controller comprises: a hardware storage device; a storage control component for storing, on said hardware storage device, a set of criteria for preferredness of ETL stage placements; an I/O input device detecting component for recognizing a proposed placement of a stage on a GUI canvas on the display device; an analytical component for analyzing an eventual result of the proposed placement in an ETL activity represented on the GUI canvas; a comparator for comparing the eventual result of the proposed placement in the ETL activity with the set of criteria; and an indicator control component for, responsive to an outcome of an operation of the comparator, providing to the user an indicator of a degree of preferredness of said proposed placement according to the set of criteria.
    Type: Application
    Filed: October 23, 2013
    Publication date: June 26, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: LEONARD D. GREENWOOD, ARRON J. HARDEN, JULIAN J. VIZOR
  • Publication number: 20140181005
    Abstract: Runtime performance of report generation and other response to query requests can be improve using a report analyzer that can parse the query request and distribute the necessary analytical task among two or more analytical engines of varying levels of complexity.
    Type: Application
    Filed: December 20, 2012
    Publication date: June 26, 2014
    Inventors: Dirk Baumgaertel, Uwe Schwerk, Karl-Peter Nos
  • Patent number: 8762324
    Abstract: Embodiments relate to systems and methods employing personalized query expansion to suggest measures and dimensions allowing iterative building of consistent queries over a data warehouse. Embodiments may leverage one or more of: semantics defined in multi-dimensional domain models, user profiles defining preferences, and collaborative usage statistics derived from existing repositories of Business Intelligence (BI) documents (e.g. dashboards, reports). Embodiments may utilize a collaborative co-occurrence value derived from profiles of users or social network information of a user.
    Type: Grant
    Filed: March 23, 2012
    Date of Patent: June 24, 2014
    Assignee: SAP AG
    Inventors: Raphael Thollot, Nicolas Kuchmann-Beauger, Corentin FollenFant
  • Patent number: 8762323
    Abstract: A swap-based multitenant database load balancing (SWAT) invention finds an optimal swap set that eliminates hotspots, and finds a sequencing and parallelization of swap executions for the multitenant database.
    Type: Grant
    Filed: October 10, 2011
    Date of Patent: June 24, 2014
    Assignee: NEC Laboratories America, Inc.
    Inventors: Hyun Jin Moon, Wang-Pin Hsiung, Vahit Hakan Hacigumus, Yun Chi
  • Patent number: 8763096
    Abstract: A computer-implemented method for managing authentication may include identifying authentication-capabilities information of an online service. The computer-implemented method may also include identifying, within the authentication-capabilities information, a specification indicating how a remote computing agent may interact with the online service to perform an authentication function supported by the online service. The computer-implemented method may further include using the specification to interact with the online service to perform the authentication function. Various other methods, systems, and computer-readable media are also disclosed.
    Type: Grant
    Filed: March 26, 2009
    Date of Patent: June 24, 2014
    Assignee: Symantec Corporation
    Inventors: Brian Hernacki, Shaun Cooley
  • Publication number: 20140172777
    Abstract: The present disclosure generally relates to accessing data, and more particularly, to systems and methods for improving the efficiency and quality of real-time extracting, transforming, and/or loading data using customer information control system (CICS) interval control element (ICE) chain processing.
    Type: Application
    Filed: February 25, 2014
    Publication date: June 19, 2014
    Applicant: American Express Travel Related Services Company, Inc.
    Inventor: Krishna K. Lingamneni
  • Patent number: 8751438
    Abstract: A device receives instructions for extracting, transforming, and loading data; identifies a plurality of data processing activities based on the instructions; and determines a processing sequence for performing the plurality of data processing activities. The device further creates a plurality of processing objects corresponding to the plurality of data processing activities. Each one of the plurality of processing objects including a discrete data structure that is produced by performing a corresponding one of the plurality of data processing activities. The device performs the plurality of data processing activities in accordance with the processing sequence and by using the plurality of processing objects.
    Type: Grant
    Filed: April 13, 2012
    Date of Patent: June 10, 2014
    Assignee: Verizon Patent and Licensing Inc.
    Inventors: Manish G. Kharod, Nityanand Sharma, Sutap Chatterjee, Kopinath Ratnam, Hrishikesh Premkumar, Ganesh R. Bhathivi, Bhaskar R. Gudlavenkatasiva, Josy John, Naseemunnisa Hussain, Salman Fariz
  • Patent number: 8751439
    Abstract: An apparatus and a method for preserving privacy in natural language databases are provided. Natural language input may be received. At least one of sanitizing or anonymizing the natural language input may be performed to form a clean output. The clean output may be stored.
    Type: Grant
    Filed: June 25, 2013
    Date of Patent: June 10, 2014
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Dilek Z. Hakkani-Tur, Yucel Saygin, Min Tang, Gokhan Tur
  • Patent number: 8751475
    Abstract: Metadata related to a media object may be used to obtain additional information about a mark created by a user in the media object. A media object being played by a user may be marked by creating a mark object, the mark object being used to determine the metadata to be used to obtain additional information related to the mark.
    Type: Grant
    Filed: February 14, 2007
    Date of Patent: June 10, 2014
    Assignee: Microsoft Corporation
    Inventors: Megan Lesley Tedesco, Yasser Asmi
  • Publication number: 20140156590
    Abstract: A terms listing production system generates a terms listing associated with an electronic document. Primary terms and primary definitions may be extracted from a glossary associated with the electronic document. Secondary terms may be identified based on user searches in relation to the electronic document. Secondary definitions associated with primary terms or secondary terms may be retrieved from external content and stored in connection with the associated terms. A markup language document configured for presentation to users is generated based on the stored terms and definitions.
    Type: Application
    Filed: March 4, 2013
    Publication date: June 5, 2014
    Inventors: Vincent Le Chevalier, Charles F. Geiger
  • Publication number: 20140156591
    Abstract: A computer implemented method integrates data from remote disparate data sources by processing a non-transitory media. The non-transitory media stores instructions for detecting data sets in different formats hosted in a plurality of heterogeneous databases that are accessible through a distributed network. The method extracts schema data from the plurality of heterogeneous databases and identifies related fields in two or more of the heterogeneous databases. The method links the related fields in the two or more of the plurality of heterogeneous databases and makes the data accessible through a virtual warehouse. As schemas change, as new data sources and analysis artifacts are created, the computer implemented method and system can act as a meta-data store, a provenance tracking device, and/or a knowledge management service.
    Type: Application
    Filed: November 25, 2013
    Publication date: June 5, 2014
    Inventors: Sreenivas R. Sukumar, Regina K. Ferrell, Mallikarjun Shankar
  • Publication number: 20140156589
    Abstract: Disclosed herein are techniques for arranging a series of operations. It is determined whether an operation executes more efficiently when divided. It is further determined whether a plurality of operations execute more efficiently when combined.
    Type: Application
    Filed: November 30, 2012
    Publication date: June 5, 2014
    Applicant: Hewlett-Packard Development Company, L.P.
    Inventors: ALKIVIADIS SIMITSIS, William K. Wilkinson
  • Patent number: 8745038
    Abstract: Computer-based methods, computer-readable storage media and computer systems are provided for optimizing integration flow plans. An initial integration flow plan, one or more objectives and/or an objective function related to the one or more objectives may be received as input. A computing cost of the initial integration flow plan may be compared with the objective function. Using one or more heuristics, a set of close-to-optimal integration flow plans may be identified from all possible integration flow plans that are functionally equivalent to the initial integration flow plan. A close-to-optimal integration flow plan with a lowest computing cost may be selected from the set as a replacement for the initial integration flow plan.
    Type: Grant
    Filed: February 25, 2010
    Date of Patent: June 3, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Alkiviadis Simitsis, William K. Wilkinson, Umeshwar Dayal, Maria G. Castellanos
  • Patent number: 8744994
    Abstract: A method and system are disclosed for use with an ETL (Extract, Transform, Load) process, comprising optimizing a filter expression to select a subset of data and evaluating the filter expression on the data after the extracting, before the loading, but not during the transforming of the ETL process. The method and system optimizes the filtering using a pipelined evaluation for single predicate filtering and an adaptive optimization for multiple predicate filtering. The adaptive optimization includes an initial phase and a dynamic phase.
    Type: Grant
    Filed: December 23, 2008
    Date of Patent: June 3, 2014
    Assignee: International Business Machines Corporation
    Inventors: Ying Chen, Bin He, Rui Wang
  • Patent number: 8744839
    Abstract: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of text segments to form a text data combination set; determining an intersection of the candidate word set and the text data combination set, the intersection comprising a plurality of text data combinations; determining a plurality of designated characteristic values for the plurality of text data combinations; based at least in part on the plurality of designated characteristic values and according to at least a criterion, recognizing among the plurality of text data combinations target words whose characteristic values fulfill the criterion.
    Type: Grant
    Filed: September 22, 2011
    Date of Patent: June 3, 2014
    Assignee: Alibaba Group Holding Limited
    Inventors: Haibo Sun, Yang Yang, Yining Chen
  • Publication number: 20140149338
    Abstract: Various systems and methods for financial analysis are provided. A system is provided comprising a first node comprising a public facing data store in communication with a private facing data store, wherein the first node further comprises a cooperative lookup module configured to locate a second node, a secure data connection between the first node and the second node, and wherein the first node is configured to request processed internal data from the second node.
    Type: Application
    Filed: January 30, 2014
    Publication date: May 29, 2014
    Applicant: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.
    Inventors: DAVID S. BONALLE, MICHAEL S. LEMBERGER, PATRICK R. LOWERY, RAJENDRA R. RANE
  • Patent number: 8738565
    Abstract: A computer implemented method, apparatus, and computer usable program code for collecting data. The data is collected from a particular data source by an application portion of agent to form collected data. The application portion is associated with the particular data source in response to an event. The collected data stored in a temporary data store for transfer to a data storage system to form stored data. The stored data sent to the data storage system using a generic portion of the agent, wherein the generic portion of the agent is configured to send data to the data storage system.
    Type: Grant
    Filed: November 7, 2005
    Date of Patent: May 27, 2014
    Assignee: International Business Machines Corporation
    Inventors: Jonathan Michael Cook, Randy Allan Rendahl, David G. Robinson, Shaw-Ben Shi, Byron Eugene Todd, III
  • Patent number: 8738608
    Abstract: A database access model and storage structure that efficiently support concurrent OLTP and OLAP activity independently of the data model or schema used, are described. The storage structure and access model presented avoid the need to design schemas for particular workloads or query patterns and avoid the need to design or implement indexing to support specific queries. Indeed, the access model presented is independent of the database model used and can equally support relational, object and hierarchical models amongst others.
    Type: Grant
    Filed: April 6, 2011
    Date of Patent: May 27, 2014
    Assignee: Justone Database, Inc.
    Inventor: Duncan G. Pauly
  • Patent number: 8732116
    Abstract: List information can be extracted into database tables. A number of fields are independently determined for items in list. A number of database table columns are determined from most common number of list item fields. New fields are determined for items with more fields than database columns. Null fields are inserted into items with fewer fields than database columns. Information from items having the same number of fields as database columns is written to database table rows. Information from each field is written to a corresponding database table column. Streaks of poorly matching cells in a database table row are determined. Streak cells are merged and new cells are determined. Null cells are inserted if number of new cells is less than number of cells in the streak. Information from the new cells is written to the table row and columns that define the streak.
    Type: Grant
    Filed: March 5, 2012
    Date of Patent: May 20, 2014
    Assignee: Google Inc.
    Inventors: Hazem Elmeleegy, Jayant Madhavan, Alon Halevy