Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 11302314
    Abstract: Embodiments are directed to organizing conversation information. A tracker vocabulary may be provided to a universal model to predict a generalized vocabulary associated with the tracker vocabulary. A tracker model may be generated based on the portions of the universal model activated by the tracker vocabulary such that a remainder of the universal model may be excluded from the tracker model. Portions of a conversation stream may be provided to the tracker model. A match score may be generated based on the track model and the portions of the conversation stream such that the match score predicts if the portions of the conversation stream may be in the generalized vocabulary predicted for the tracker vocabulary. Tracker metrics may be collected based on the portions of the conversation and the match scores such that the tracker metrics may be included in reports or notifications.
    Type: Grant
    Filed: November 10, 2021
    Date of Patent: April 12, 2022
    Assignee: Rammer Technologies, Inc.
    Inventors: Toshish Arun Jawale, Anthony Claudia, Surbhi Rathore
  • Patent number: 11301470
    Abstract: A control method for performing a multi-table join operation and a corresponding apparatus are disclosed. Fields in a table whose fields participating in a theta join operation and used in the query statement meet a first preset condition are decomposed, to obtain a plurality of first field groups, so that the theta join operation can be implemented in steps in a form of the plurality of field groups. This can reduce a data amount of Cartesian product calculation during one join operation, greatly reduce network transmission overheads, computing overheads, and memory overheads, and improve execution efficiency.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: April 12, 2022
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Wenjie Liu, Jiansheng Wei, Xiaofei Zhang, Lei Chen
  • Patent number: 11301476
    Abstract: A method according to embodiments includes: storing an entire hash table of a table R in memory of a computational storage device; storing a second table S in storage media of the computational storage device, the table R being smaller than the table S, wherein the hash JOIN operation is directed to combining a cs-th column in the table S and a cr-th column in the table R; wherein, for each row of the table S, the computational storage device configured to perform a method, including: applying a hash function to a value of the cs-th column to provide a hash result; looking up the hash result in the hash table stored in the memory of the computational storage device; and if the hash result is found, sending the row of the table S and a corresponding row index of the table R to the host computing system.
    Type: Grant
    Filed: July 17, 2019
    Date of Patent: April 12, 2022
    Assignee: SCALEFLUX, INC.
    Inventors: Tong Zhang, Yang Liu, Fei Sun, Hao Zhong
  • Patent number: 11290524
    Abstract: Embodiments are directed to communicating between computing nodes in a cluster of nodes. In one scenario, a computer system receives a data packet from a worker node including the worker node's current workload identifiers and health status, where the data packet includes an associated version number. The computer system determines that the version number in the received data packet is different than a previously received data packet and evaluates the worker node's current workload configuration to determine whether workload changes are to be made on the worker node. Then, upon determining that workload changes are to be made on the worker node, the computer system selects a subset of workload changes to apply to the worker node, generates an indication of the selected subset of workload changes to the worker node and sends the generated indication of workload changes to the worker node.
    Type: Grant
    Filed: August 13, 2014
    Date of Patent: March 29, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Abhinit Kumar, Andrea D'Amato, David Allen Dion, Gor Nishanov, Lokesh Srinivas Koppolu, Nicholas Maliwacki
  • Patent number: 11288269
    Abstract: Examples of a process optimization system are provided. The system may obtain a query from a user and process data associated with the query from a plurality of data sources. The system may identify a plurality of factors and a target variable from the process data. The system may identify a factor range for the plurality of factors. The system may deconstruct the factor range to identify a plurality of data partitions. The system may identify a data pruning activator based on the plurality of data partitions and a preponderant data partition therefrom. The system may identify a plurality of clusters associated with the preponderant data partition. The system may identify a preponderant cluster from the plurality of clusters. The system may identify a confidence score associated with the preponderant cluster. The system may generate a process optimization result based on the preponderant cluster and the confidence score.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: March 29, 2022
    Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED
    Inventors: Gurpreet Singh Bawa, Kaustav Pakira, Tejas Anant Wagh
  • Patent number: 11288241
    Abstract: Methods and systems for determining relationships between two or more nominally unrelated data sources utilizing a combination of probabilistic modeling and graphical clustering are described. The systems and methods for utilizing probabilistic model functions as a way of determining and judging the likelihood that two records from different systems are related to the same entity.
    Type: Grant
    Filed: September 24, 2019
    Date of Patent: March 29, 2022
    Assignee: MASSACHUSETTS MUTUAL LIFE INSURANCE COMPANY
    Inventors: Sears Merritt, Thom Neale
  • Patent number: 11275755
    Abstract: Automatically extracting lineage data for distributed file systems and databases by generating a file folder including data and having a file path, generating a data table associated with the data and the file path, tracking the lineage of data into the file folder and the lineage of data from the data table, and recording an overall lineage comprising the lineage of data into the file folder and from the data table.
    Type: Grant
    Filed: October 7, 2019
    Date of Patent: March 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Lukasz Jakub Palus, Grzegorz Piotr Szczepanik, Krzysztof Rudek, Michal Bodziony
  • Patent number: 11277265
    Abstract: An online host method to verify hosted information associated with a user includes establishing, by the online host serving as a relying party system (RPS), a secure connection between the RPS and a user mobile-identification-credential device (UMD). The RPS sends a mobile identification credential (MIC) user information request to the UMD, via the secure connection, seeking release of MIC user information (official information). The RPS obtains from authorizing party system (APS) verification of the MIC user information received in response to the MIC user information request. The RPS stores the MIC user information as hosted information pertaining to the user. The RPS designates the hosted information as base truth information representing the user.
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: March 15, 2022
    Assignee: The Government of the United States of America, as represented by the Secretary of Homeland Security
    Inventors: Daniel A. Boyd, Kelli L. Biegger, Chang Ellison, Brandon P. Gutierrez, Jason Lim, William Washington
  • Patent number: 11268732
    Abstract: A building energy management system includes building equipment, a data collector, an analytics service, a timeseries database, and an energy management application. The building equipment monitor and control one or more variables in the building energy management system and provide data samples of the one or more variables. The data collector collects the data samples from the building equipment and generates a data timeseries including a plurality of the data samples. The analytics service performs one or more analytics using the data timeseries and generates a results timeseries including a plurality of result samples indicating results of the analytics. The timeseries database stores the data timeseries and the results timeseries. The energy management application retrieves the data timeseries and the results timeseries from the timeseries database in response to a request for timeseries data associated with the one or more variables.
    Type: Grant
    Filed: December 27, 2019
    Date of Patent: March 8, 2022
    Assignee: Johnson Controls Technology Company
    Inventors: Andrew J. Boettcher, Steven R. Vitullo, Vivek Narain, Youngchoon Park, Gerald A. Asp, Peter A. Craig, Vijaya S. Chennupati
  • Patent number: 11259166
    Abstract: A method and apparatus for recommending that a user dial an emergency number to report an incident is provided herein. During operation, a user submits anonymous information about an incident to a tipping server. The tipping server determines if the incident that the user is reporting is unreported to an emergency number. If the incident remains unreported to an emergency number, the user is provided a suggestion to call an emergency number to report the incident.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: February 22, 2022
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventors: Bing Qin Lim, Chong Keat Chua, Wei Hong Tew, Zhe Qian Goh
  • Patent number: 11250128
    Abstract: A system includes a source code repository which stores source code entries, which include instructions in a programming language for performing computing tasks. A style repository stores a style profile for a plurality of users. Each style profile includes predefined style features associated with formatting characteristics of the stored source code entries for a corresponding user. A source code analyzer receives, from a user, a source code which includes instructions in the programming language for performing a computing task. Style features of the source code are determined. The style features include characteristics of a format of the source code. The source code analyzer determines whether the style features correspond to predefined style features indicated by a style profile of the user. If this is the case, the source code is stored in the source code repository. If this is not the case, storage of the source code is prevented.
    Type: Grant
    Filed: February 18, 2020
    Date of Patent: February 15, 2022
    Assignee: Bank of America Corporation
    Inventor: Vidhya Vaithiyanathan
  • Patent number: 11245704
    Abstract: Aspects of the disclosure relate to account lineage tracking and automatically executing responsive actions upon detecting an account lineage. A computing platform may receive a first account-change message from a source-level interceptor. The first account-change message may include information identifying a source account associated with a first computing device and identifying a first target account. The first target account may be associated with a target application configured to access the target database. The computing platform may receive a second account-change message from a database-level interceptor. The second account-change message may include information identifying the first target account as a database-level source account and identifying a second target account associated with one or more target databases.
    Type: Grant
    Filed: January 8, 2020
    Date of Patent: February 8, 2022
    Assignee: Bank of America Corporation
    Inventors: George Albero, Edward Lee Traywick, Scot L. Daniels
  • Patent number: 11232125
    Abstract: In accordance with various embodiments of the present disclosure, a query for information related to machine data generated by one or more machine data sources of a cloud computing platform (CCP) is sent by a client computing device and to a cloud computing monitoring component of the CCP, where the query is formed using native query language of the CCP. As a result, the client computing device via a connector receives a first data object that is formatted in accordance with a first format associated with the CCP. The client computing device via the connector may then convert the first data object to one or more second data objects formatted in accordance with a second format that allows for enhanced ingestion by a data intake and query system.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: January 25, 2022
    Assignee: Splunk Inc.
    Inventors: Ujwala Khante, Daniel See, Nicholas Tankersley, Po Hsin Wang
  • Patent number: 11228598
    Abstract: An offline mode user account authorization method includes obtaining an access privilege of a user account and setting an offline task access privilege of the user account according to the access privilege, retrieving user account information corresponding to a task to be executed from a preset task information comparison table when the task to be executed is implemented, determining whether the user account is online, executing the task to be executed with a user ID of the user account if the user account is online, and creating a super offline user account and assigning offline task access privilege of the user account to the super offline user account if the user account is not online. The implemented task to be executed is executed with the super offline user account.
    Type: Grant
    Filed: July 5, 2019
    Date of Patent: January 18, 2022
    Assignees: Fu Tai Hua Industry (Shenzhen) Co., Ltd., HON HAI PRECISION INDUSTRY CO., LTD.
    Inventors: Peng-Fei Guo, Ze Yao, Yue-Ping Zhou, Shou-Feng Fan, Fu-Fa Cai, Hui-Feng Liu, Xin Lu, Chuan Xiao
  • Patent number: 11227234
    Abstract: A normalized driver performance comparison framework can include predicting a plurality of scenarios of a ground vehicle. A Monte Carlo simulation can be utilized to determine results using the predicted plurality of scenarios and one or more predictive models. Best and worst possible scenarios and associated efficiency for an observed scenario can be determined based on the Monte Carlo simulation results. The efficiency for the observed scenario can be normalized as a function of the best and worst efficiencies.
    Type: Grant
    Filed: June 11, 2019
    Date of Patent: January 18, 2022
    Assignee: Traxen Inc.
    Inventors: Saeed Zamanzad Gavidel, Gerti Tuzi, Syed Ali, Ali Maleki
  • Patent number: 11216454
    Abstract: A parallel database system processes database queries invoking a user defined function (UDF). The database query is specified in a database query language, for example, the structured query language (SQL) and the UDF is specified in a programming language different from the database query language, for example, C/C++. The UDF may include multiple phases. The parallel database system generates optimized code for the UDF. The parallel database system includes a leader node and a plurality of compute nodes. The database query processes one or more database tables such that each compute node processes a slice of each database table. The leader node sets up a global context structure and provides it to each compute node. The global context may be shared across multiple executions of the query. The UDF may be used for efficient execution of extraction, transformation, and loading of data from external systems.
    Type: Grant
    Filed: January 21, 2017
    Date of Patent: January 4, 2022
    Assignee: ACTIAN SUB III, INC.
    Inventors: Richard L. Cole, Yijou Chen, Eugene Szedenits, Jr.
  • Patent number: 11216447
    Abstract: Aspects described herein generally improve the quality, efficiency, and speed of data processing systems by generating staging data independently from the execution of control scripts which process the staging data. The staging data can be independently loaded, validated, and utilized across multiple control scripts, reducing redundancy in the loading of data and the overhead of executing separate data processing for each control script. The control scripts can be automatically validated, such as by verifying expected output data ranges. Additionally, the complexity of the control scripts can be reduced as the loading of data is not performed by the control scripts. The controls scripts can generate a variety of output data, such as an indication of impacted accounts, and provide notifications based on the output data. A variety of machine learning classifiers can be used to automatically generate the staging data and validate the staging data and/or output data.
    Type: Grant
    Filed: February 4, 2020
    Date of Patent: January 4, 2022
    Assignee: Capital One Services, LLC
    Inventors: Daniel Gunn, Zhihao Gao, Pulkit Gulati, William Cartar
  • Patent number: 11212260
    Abstract: A method for dynamic firewall configuration for accessing service hosted in virtual networks includes monitoring, in a virtual network, changes in an Internet protocol (IP) address of a service hosted in a virtual network. The method further includes detecting a change the IP address of the service hosted in the virtual network. The method further includes communicating notification of the change in IP address to a firewall policy management interface. The method further includes, automatically configuring a firewall to allow access to the service hosted in the virtual network.
    Type: Grant
    Filed: March 24, 2018
    Date of Patent: December 28, 2021
    Assignee: KEYSIGHT TECHNOLOGIES, INC.
    Inventors: Kristopher Len Raney, Winston Wencheng Liu, Matthew R. Bergeron
  • Patent number: 11210294
    Abstract: A method for searching for rules of association in a database. A list of instances exhibiting a set of real numerical values taken by a predetermined number of variables, including: —selecting a set of NI input variables from among the variables of the list of instances, the input variables defining a space of dimension NI; and selecting of an output variable from among the remaining variables; —the ordering, for each input variable of the selected list of instances, of the numerical values of the instances for this variable, each instance then being defined by a set of ranks, and being represented by a point in the space of dimension NI; —the definition of at least one modality for the selected output variable; —the exploration, in sub-spaces of the space of dimension NI, of right convex regions; —the selection, from among the explored right convex regions, of regions of interest.
    Type: Grant
    Filed: April 17, 2018
    Date of Patent: December 28, 2021
    Assignee: Quantics Technologies
    Inventor: Stéphane Martinetti
  • Patent number: 11200236
    Abstract: A system includes reception of a filter on a first dimension of a first data model, determination of an association between a second dimension of the first data model and a first dimension of a second data model, generation of a calculated dimension based on the second dimension, the rows of the calculated dimension being identical to the rows of the first dimension, application of the filter to the rows of the calculated dimension to generate filtered rows of the calculated dimension, execution of an inner join between the filtered rows of the calculated dimension and a fact table of the second data model to generate a result table including one or more measure values of the second data model, and generation of a visualization including the result table.
    Type: Grant
    Filed: September 6, 2019
    Date of Patent: December 14, 2021
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Ping Xiang, Semuel Kadarusman, Patrick Wang, Justin Wong, Veljko Jovanovic
  • Patent number: 11200131
    Abstract: A method includes monitoring a job being executed at the source mainframe. A job comprises multiple tasks. A method includes monitoring a particular task of the multiple tasks being executed at a source mainframe and determining an application required to execute the particular task. In response to determining that the particular task requires an application to execute, determining a target mainframe where the application is installed. A method further includes validating the environment of the target mainframe to confirm that the particular task can be executed using the target mainframe, and upon validating the target mainframe, redirecting the particular task to the target mainframe for execution. A method also includes monitoring the particular task being executed at the target mainframe and returning the results of the particular task from the target mainframe to the source mainframe.
    Type: Grant
    Filed: November 12, 2019
    Date of Patent: December 14, 2021
    Assignee: VIRTUALZ COMPUTING CORPORATION
    Inventors: Jeanne M. Glass, Dustin W. Froyum, Vincent R. Re
  • Patent number: 11194772
    Abstract: A mechanism is provided for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
    Type: Grant
    Filed: January 31, 2017
    Date of Patent: December 7, 2021
    Assignee: International Business Machines Corporation
    Inventors: Neil E. Bartlett, Craig A. Statchuk
  • Patent number: 11194845
    Abstract: Embodiments are directed to managing a data model that includes data objects associated with layers of the data model may be provided. A lineage model that includes lineage objects that has one or more lineage objects associated with a layer may be provided. A lineage object associated with a layer may be activated. A first portion data objects may be determined based on the activated lineage object. An anchor object may be determined from the first portion of data objects. A search of the data model may be employed to provide dependency information based on a position of the anchor data object in the data model. A second portion of data objects may be determined based on the dependency information. A report that includes a list of the second portion of data objects, the dependency information, or the like, may be provided.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: December 7, 2021
    Assignee: Tableau Software, LLC
    Inventors: Tyler James-Buker Doyle, Graeme Riley Britz, Jonathan Joseph Bryak, Cassius David Butcher, Jun Ho Kim, Jiyoung Oh
  • Patent number: 11182404
    Abstract: The present invention contemplated the use of Change Data Capture (CDC) between a source database and a target database. To execute CDC, one or more computer processors execute the steps of obtaining a plurality of log records comprising information on transactions processed in the source database, grouping the plurality of log records into a plurality of groups based on a predetermined condition, and determining that the plurality of obtained log records is to be replicated from the source database to the target database in parallel in a unit of a group.
    Type: Grant
    Filed: November 9, 2017
    Date of Patent: November 23, 2021
    Inventors: Jeong Il Yoon, Owook Jo
  • Patent number: 11183176
    Abstract: Systems and methods for e-commerce systems using natural language understanding are described. A computing device is configured receive at least one application-specific element for a natural language conversation application, at least one shared element, and a user utterance, The system identifies at least one intent based on the at least one application-specific element and the at least one shared element and generates a system utterance based on the at least one application-specific element.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: November 23, 2021
    Assignee: Walmart Apollo, LLC
    Inventors: Snehasish Mukherjee, Haoxuan Chen, Shankara Bhargava Subramanya, Tanmay Peter Kuruvilla
  • Patent number: 11176028
    Abstract: A standard conformance testing system and method for a CIM/E model of a power system and a storage medium include: converting the standards “Power Grid Common Model Description Specification (GB/T 30149)” and “Power Grid Operating Model Data Exchange Specification (DL/T 1380)” into practically operable testing rules, and performing automatic generation and verification of a CIM/E standard model and defect model. The testing system and method can improve the comprehensiveness and standardization of a CIM/E model standard compliance test, improving testing efficiency, promoting the level of standardization according to the CIM/E model in a power system software development process, and real time data exchange and interoperation between respective application systems, effectively supporting the safe and stable operation of a power grid.
    Type: Grant
    Filed: July 4, 2016
    Date of Patent: November 16, 2021
    Assignees: CHINA ELECTRIC POWER RESEARCH INSTITUTE COMPANY LIMITED, STATE GRID CORPORATION OF CHINA
    Inventors: Linpeng Zhang, Yujia Li, Qingbo Yang, Lixin Li, Fangchun Di, Yunhao Huang, Dapeng Li, Lei Tao, Yangchun Hao, Zhenyu Chen
  • Patent number: 11164232
    Abstract: Some aspects of the present disclosure are directed to computerized methods for extracting attributes from product titles. The method may include: retrieving a title associated with a product listing and historical product title data; refining the title; determining at least one tag associated with an attribute; generating, based on the at least one extracted tag and the historical title data, a first combination of one or more attributes; determining whether the title includes at least one plurality of product options, and if so: determining, for each product option in the plurality of product options, a second combination of one or more attributes by removing attributes associated with alternative product options from the first combination; and generating a product identifier based on the second combination; and if the title does not include at least one plurality of product options, generating, the product identifier based on the first combination.
    Type: Grant
    Filed: January 15, 2021
    Date of Patent: November 2, 2021
    Assignee: COUPANG CORP.
    Inventors: Joon Shik Hong, Seong Jin Lee, Han Byul Bang
  • Patent number: 11157469
    Abstract: Systems and methods are described herein for performing automated audit balance control (ABC) procedures for data that has moved between data stores within an enterprise or other organization. The systems and methods inspect the data at a target data store and determine the quality of the movement of the data to the target data store based on the inspection. For example, the systems and methods can inspect row or record counts for the data in a data store, aggregate numeric sums within the data at the target data store, perform key data value comparisons between different locations that contained the data, perform full data comparisons between different locations that contained the data, perform checksum comparisons, apply complex business rules, and so on, when determining or verifying the completeness and accuracy of data as it arrives to a temporary or permanent location.
    Type: Grant
    Filed: December 31, 2018
    Date of Patent: October 26, 2021
    Assignee: T-Mobile USA, Inc.
    Inventors: Jeffrey Scott, Venkata Reddy Donthireddy
  • Patent number: 11157688
    Abstract: Technologies are described herein for displaying one or more visual indicators to assist users in identifying cells of a spreadsheet that are affected by an operation. In embodiments disclosed herein, an application is configured to receive a request to perform an operation on at least one cell of the spreadsheet. The application then identifies cells of the spreadsheet that are affected by the operation. The application then displays a visual indicator to emphasize the affected cells. By providing a timely displayed visual indicator that brings attention to affected cells, the user is able to gain a better perspective of the extent of the operation and how the operation applies to the cells of the spreadsheet.
    Type: Grant
    Filed: March 14, 2014
    Date of Patent: October 26, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Allison Jane Rutherford, Marta Luis Burguete, Samuel Chow Radakovitz, Aaron Lamar Wilson, Uhl Albert
  • Patent number: 11157565
    Abstract: A system includes reception of a database query, the query comprising a first query statement to insert rows of a source table into a target table, generation a second query statement and a third query statement based on the first query statement, the second query statement to insert rows of a first partition of the source table into the target table and the third query statement to insert rows of a second partition of the source table into the target table, generation of a first query operator executable to perform the second query statement, generation of a second query operator executable to perform the third query statement, transmission of the first executable query operator to a second database server node storing the first partition of the source table, and transmission of the second executable query operator to a third database server node storing the second partition of the source table.
    Type: Grant
    Filed: April 4, 2019
    Date of Patent: October 26, 2021
    Assignee: SAP SE
    Inventors: SungGun Jung, Yuchon Yi, Holger Bischoff
  • Patent number: 11157777
    Abstract: According to one implementation, a quality control (QC) system for annotated content includes a computing platform having a hardware processor and a system memory storing an annotation culling software code. The hardware processor executes the annotation culling software code to receive multiple content sets annotated by an automated content classification engine, and obtain evaluations of the annotations applied by the automated content classification engine to the content sets. The hardware processor further executes the annotation culling software code to identify a sample size of the content sets for automated QC analysis of the annotations applied by the automated content classification engine, and cull the annotations applied by the automated content classification engine based on the evaluations when the number of annotated content sets equals the identified sample size.
    Type: Grant
    Filed: July 15, 2019
    Date of Patent: October 26, 2021
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Matthew C. Petrillo, Marc Junyent Martin, Anthony M. Accardo, Avner Swerdlow, Monica Alfaro Vendrell
  • Patent number: 11151161
    Abstract: Replication and failover of database data is disclosed. A method includes replicating database data stored in a primary deployment such that the database data is further stored in a secondary deployment. The method includes executing one or more updates to the database data at the secondary deployment when the primary deployment is unavailable and propagating the one or more updates to the primary deployment when the primary deployment becomes available again. The method includes executing queries on the database data at the primary deployment when the primary deployment is available.
    Type: Grant
    Filed: April 23, 2019
    Date of Patent: October 19, 2021
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Eric Robinson, Martin Hentschel
  • Patent number: 11144979
    Abstract: The present disclosure provides a method of automatically extracting data from web pages and analyzing the extracted data to generate an output. A plurality of web pages of a plurality of merchants is accessed. Based on the accessing of the web pages, a subset of the plurality of web pages is identified as inventory pages that contain information about products or services offered for sale. The inventory pages are electronically scanned to extract a price for each of the products or services. An output is generated that includes a listing of the products or services and prices associated with the products or services, respectively.
    Type: Grant
    Filed: April 1, 2020
    Date of Patent: October 12, 2021
    Assignee: PAYPAL, INC.
    Inventors: Olga Sharshevsky, Yael Cohen, Elina Nemtsov
  • Patent number: 11137987
    Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide support for auto-mapping of complex data structures, datasets or entities, between one or more sources or targets of data, referred to herein in some embodiments as HUBs. The auto-mapping can be driven by a metadata, schema, and statistical profiling of a dataset; and used to map a source dataset or entity associated with an input HUB, to a target dataset or entity or vice versa, to produce an output data prepared in a format or organization (projection) for use with one or more output HUBs.
    Type: Grant
    Filed: August 22, 2017
    Date of Patent: October 5, 2021
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Hassan Heidari Namarvar, Alexander Sasha Stojanovic, David Allan, Ganesh Seetharaman
  • Patent number: 11138190
    Abstract: Systems, methods, and devices for generating a materialized view over an external table. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating, by the database platform, a materialized view over the external table. The method includes receiving a notification that a modification has been made to the external table, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes, in response to the external table being modified, refreshing the materialized view such that the materialized view comprises an accurate representation of the external table.
    Type: Grant
    Filed: January 28, 2020
    Date of Patent: October 5, 2021
    Assignee: Snowflake Inc.
    Inventors: Subramanian Muralidhar, Benoit Dageville, Thierry Cruanes, Nileema Shingte, Saurin Shah, Torsten Grabs, Istvan Cseri
  • Patent number: 11140036
    Abstract: Methods and systems for identifying groups of related nodes in an integration flow are disclosed.
    Type: Grant
    Filed: January 16, 2019
    Date of Patent: October 5, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Doina L. Klinger, John A. Reeve, Ashley Harrison, Vlad Balanescu
  • Patent number: 11126737
    Abstract: System and method of decentralized services to make federated raw data sets owned by a plurality of Publishers Self-Governing for secure sharing and commingling allowing the commingled data to be productively used or analyzed by Subscribers only in an indirect manner that prevents extraction by any party and unauthorized in contradiction to the Self-Governing need-to-know policies defined by each Publisher.
    Type: Grant
    Filed: January 9, 2019
    Date of Patent: September 21, 2021
    Inventor: Randy Friedman
  • Patent number: 11126605
    Abstract: A distributed storage system may store data object instances in persistent storage and may store keymap information for those data object instances in a distributed hash table on multiple computing nodes. Each data object instance may include a composite key containing a user key. The keymap information for each data object instance may map the user key to a locator and the locator to the data object instance. A request to store or retrieve keymap information for a data object instance may be routed to a particular computing node based on a consistent hashing scheme in which a hash function is applied to a portion of the composite key of the data object instance. Thus, related entries may be clustered on the same computing nodes. The portion of the key to which the hash function is applied may include a pre-determined number of bits or be identified using a delimiter.
    Type: Grant
    Filed: March 15, 2019
    Date of Patent: September 21, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Jason G. McHugh, Praveen Kumar Gattu, Michael A. Ten-Pow, Derek Ernest Denny-Brown, II
  • Patent number: 11106760
    Abstract: Enhanced metadata with optimized output. In an embodiment, a content object is received. First metadata to be associated with the content object is determined. At least one metadata field to be acquired is determined based on an association, within a stored knowledge structure, of the at least one metadata field with the first metadata. Second metadata to be associated with the content object is acquired based on the at least one metadata field. A metadata structure and markup format for the content object are determined. The metadata structure incorporates both the first metadata and the second metadata. The content object is output with the metadata structure and in the markup format.
    Type: Grant
    Filed: April 29, 2016
    Date of Patent: August 31, 2021
    Assignee: EINSTEIN INDUSTRIES, INC.
    Inventors: Robert Silkey, Ted Ricasa, Sergiy Zubatiy, Jeremy Michael Hawkins, Christopher Cherry
  • Patent number: 11086809
    Abstract: Data transfer acceleration includes receiving, by a data transfer accelerator in a first node of a plurality of nodes, from a second node of the plurality of nodes, a request for data in a second state, wherein the second node stores an instance of the data in a first state; generating a message including one or more operations to transform the data from the first state to the second state; and sending the message to the second node in response to the request.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: August 10, 2021
    Assignee: ADVANCED MICRO DEVICES, INC.
    Inventor: Anthony Gutierrez
  • Patent number: 11080113
    Abstract: A fifo queue service of a provider network allows clients replicate a fifo queue to a secondary backup queue of another region. A local instance of the queue service receives and stores send/receive/delete transactions in an order. The service instance applies the transactions to a primary fifo and replicates only the send requests and delete requests to secondary fifo queue of a remote instance of the fifo queue service (e.g., at another region). The remote instance determines, based on ordering metadata of a replicated request, that the replicated request can be stored in accordance with the ordering metadata (e.g., the replicated request depends on another request that has also been received/replicated). In response, the remote secondary instance stores and applies the replicated request to a secondary fifo queue.
    Type: Grant
    Filed: December 11, 2020
    Date of Patent: August 3, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Stefan Mircea Onofrei
  • Patent number: 11073976
    Abstract: A building energy management includes building equipment, one or more data platform services, a timeseries database, and an energy management application. The building equipment operate to monitor and control a variable and provide raw data samples of a data point associated with the variable. The timeseries database stores a plurality of timeseries associated with the data point. The plurality of timeseries include a timeseries of the raw data samples and the one or more optimized data timeseries generated by the data platform services based on the raw data timeseries. The energy management application generates an ad hoc dashboard including a widget and associates the widget with the data point. The widget displays a graphical visualization of the plurality of timeseries associated with the data point and includes interactive user interface options for switching between the plurality of timeseries associated with the data point.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: July 27, 2021
    Assignee: Johnson Controls Technology Company
    Inventors: Youngchoon Park, Vijaya S. Chennupati, Sudhi Sinha, Justin Ploegert
  • Patent number: 11061880
    Abstract: A system is provided including a memory in communication with a processor. The memory is to store a value of an attribute. The processor is to obtain from a source data structure the value. The processor is also to store in a further data structure the value in association with a descriptor identifying the source data structure. Moreover, the processor is to store in the further data structure an identifier in association with the value. The identifier is incrementable and unique to the value among values associated with the descriptor. In addition, the processor is to output the further data structure.
    Type: Grant
    Filed: January 25, 2018
    Date of Patent: July 13, 2021
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Kevin Williams, Amit Kumar Singh, Gaurav Roy
  • Patent number: 11061856
    Abstract: A device automatically extracts a data file from an upstream source based on ingestion parameters. The data file is in a first format that is not readable by a downstream data analysis utility and includes a plurality of data records. Each record includes one or more entries. The ingestion parameters include a file identifier, validation criteria, transformation instructions, and storage instructions for the extracted data file. The device generates table(s) based on the extracted data file by, determining, based on the validation criteria, whether the record is permitted to be ingested. The device determines, based on the transformation instructions, a table in which to store each entry and a row and column of the determined table in which to store the entry. Entries are input into the determined tables, rows, and columns. The table(s) are stored in a second format that is readable by the downstream data analysis utility.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: July 13, 2021
    Assignee: Bank of America Corporation
    Inventors: Thiruvadi Natarajan Sundaramoorthy, Kenneth L. Gommel
  • Patent number: 11023484
    Abstract: Embodiments of the present invention disclose generating a data profiling jobs for source data in a data processing system, the source data being described by at least one source functional data model. A target functional data model is provided, for describing target data that can be generated from the source data. One or more source functional data models are identified that correspond to the target functional data model. At least one functional source-to-target model mapping is associated to at least one source-target pair based on the target functional data model and identified source functional data models. A physical source-to-target model mapping for at least one source-target pair based on the logical source-to-target model mapping is calculated. For all physical source attributes, the needed data profiling jobs are generated based on the target attribute for analyzing the physical source attributes.
    Type: Grant
    Filed: December 6, 2017
    Date of Patent: June 1, 2021
    Assignee: International Business Machines Corporation
    Inventors: Sebastian Nelke, Martin Oberhofer, Yannick Saillet, Jens P. Seifert
  • Patent number: 11023432
    Abstract: When tenants migrate data from on-premises archiving solutions to a hosted service, tenants should maintain just enough data for compliance purposes and dispose of data that is no longer needed to reduce overall liability and compliance risk exposure. Embodiments are directed to providing selective import of data to a hosted service through a security and compliance system associated with the hosted service to reduce overall liability and compliance risk exposure. Data, usage pattern and security/compliance policies associated with a tenant of the hosted service may be analyzed. A model for importing tenant data may be created based on the analysis. A suggestion may be presented to the tenant based on the model, where the suggestion includes a filter for importing tenant data. In response to receiving a confirmation to implement the suggestion, the filter may be applied to the tenant data as it is imported to the hosted service.
    Type: Grant
    Filed: April 7, 2020
    Date of Patent: June 1, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Churli Su, Rameshkumar Perumal, Liang Huang, Sanjay H. Ramaswamy, Gaston L. Gerchkovich
  • Patent number: 11023483
    Abstract: Embodiments of the present invention disclose generating a data profiling jobs for source data in a data processing system, the source data being described by at least one source functional data model. A target functional data model is provided, for describing target data that can be generated from the source data. One or more source functional data models are identified that correspond to the target functional data model. At least one functional source-to-target model mapping is associated to at least one source-target pair based on the target functional data model and identified source functional data models. A physical source-to-target model mapping for at least one source-target pair based on the logical source-to-target model mapping is calculated. For all physical source attributes, the needed data profiling jobs are generated based on the target attribute for analyzing the physical source attributes.
    Type: Grant
    Filed: August 4, 2016
    Date of Patent: June 1, 2021
    Assignee: International Business Machines Corporation
    Inventors: Sebastian Nelke, Martin Oberhofer, Yannick Saillet, Jens P. Seifert
  • Patent number: 11010383
    Abstract: A system and method to receive a query against a plurality of data objects that are sorted, at least conceptually, to follow a hierarchical order and support a preorder traversal arrangement and a postorder traversal arrangement; translate the query into an executable query plan including at least one instance of a hierarchy rearrange operator, the hierarchy rearrange operator to rearrange an input including one or more of the plurality of data objects from one of a preorder traversal arrangement and a postorder traversal arrangement to a corresponding opposite thereof; execute the executable query plan including the at least one instance of the hierarchy rearrange operator; and generate a query result based, at least in part, on the input of the one or more plurality of data objects being rearranged in the respective preorder traversal arrangement and a postorder traversal arrangement.
    Type: Grant
    Filed: January 19, 2018
    Date of Patent: May 18, 2021
    Assignee: SAP SE
    Inventors: Norman May, Robert Brunel
  • Patent number: 11010399
    Abstract: One or more embodiments provide for a method. The method includes receiving a document including text. The method also includes receiving different definitions of a position in the document of an identifier-value pair within the text. The identifier-value pair includes an identifier identifying an information type in the text and a value including an entry of the information in the text. The method also includes determining which of the different definitions correctly point to the position to form a number of correct definitions. Each of the different definitions correctly points to the position when the identifier-value pair is found on a first attempt using a definition of the different definitions to search for the identifier-value pair. The method also includes, responsive to the number of correct definitions at least matching a threshold value, issuing a command to perform data scraping on the document to extract the value of the identifier.
    Type: Grant
    Filed: November 28, 2018
    Date of Patent: May 18, 2021
    Assignee: Intuit Inc.
    Inventors: Yair Horesh, Yehezkal Shraga Resheff, Meital Iulia Bojan, Hadar Kedem
  • Patent number: 11003783
    Abstract: Disclosed are various embodiments for searchable encrypted data stores. A plurality of rows in a data table that have a cell in a column of the row that matches a value are identified. A bitmap that represents the plurality of rows is generated. The bitmap is encrypted. A key is then generated for a key-value pair. The key can be represented by a message authentication code (MAC) based at least in part on a tuple of a name of the data table, an identifier of the column, and the value. The key-value pair with the key and the encrypted bitmap is then sent to a remote computing device for storage.
    Type: Grant
    Filed: September 21, 2018
    Date of Patent: May 11, 2021
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Cristi Ursachi, Amanda Gray, Priti Marappan, Dan Mutescu