From Unstructured Or Semi-structured Data To Structured Data Patents (Class 707/811)
  • Patent number: 11797526
    Abstract: It may be desired to query a large record of events to retrieve certain segments of the data that provide information of interest. However, the data might not be structured such that it is conducive to generate/execute queries that inquire about net changes of events over time. Query language may also be interpreted differently by different analysts. In some embodiments data related to events is assembled into a data structure having a format logically equivalent to a double entry transition table. Each event may have two entries in the table: a first entry representing the event as a transition from a previous state to a current state, and a second entry representing the reverse transition of the first entry. In some embodiments, a query executor tool may be provided as a graphical user interface for entering information of interest to be queried.
    Type: Grant
    Filed: September 16, 2021
    Date of Patent: October 24, 2023
    Assignee: SHOPIFY INC.
    Inventor: Benjamin Alfred Lucas Doyle
  • Patent number: 11645464
    Abstract: Systems, computer-implemented methods, and computer program products to transform a lexicon that describes an information asset are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a term validation component that can determine from a subject matter expert, a validated term that can indicate validation of a candidate term that describes an information asset. The computer executable components can further comprise a lexicon transforming component that, based on the validated term, can transform a lexicon that describes the information asset, by incorporating the validated term into the lexicon.
    Type: Grant
    Filed: March 18, 2021
    Date of Patent: May 9, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Anna Lisa Gentile, Chad Eric DeLuca, Petar Ristoski, Ismini Lourentzou, Linda Ha Kato, Alfredo Alba, Daniel Gruhl, Steven R. Welch
  • Patent number: 11538594
    Abstract: Implementations provide a method to consolidate data records of regimens for treating oncology conditions. The method includes: accessing data records each encoding multi-tier data characteristics of a regimen for treating a particular oncology condition; receiving a first data record encoding a first regimen specific to a first healthcare provider institution; parsing the first data record according to a hierarchy of the encoded multi-tier data characteristics; distributing a respective weight to each of the encoded data characteristics to account for the potentially missing data characteristic; comparing data characteristics of the first data record with data characteristics from the data records by applying the respective weight to each data characteristic at a particular tier of the hierarchy such that a respective compound score is generated for each data record; and based on the compound scores for all data records, determining a prevailing data record of regimen as matching the first data record.
    Type: Grant
    Filed: October 19, 2020
    Date of Patent: December 27, 2022
    Assignee: IQVIA Inc.
    Inventors: Sanam Sikander, Edmund Drage
  • Patent number: 11494786
    Abstract: A third party item listing management system usable for verification of third party items to be included on a retailer website includes an application programming interface and an item verification pipeline. The application programming interface is accessible by a plurality of third parties and is configured to receive item data associated with one or more items. The item verification pipeline is configured to receive the item data and call an item validation pipeline, the item validation pipeline includes a plurality of item validation stages including a field verification. At the field verification stage, data is extracted from at least one of an item image or text associated with the item. The data extracted is compared to item data obtained from an independent verification source to confirm the accuracy of the item data provided by the third party.
    Type: Grant
    Filed: February 7, 2020
    Date of Patent: November 8, 2022
    Assignee: Target Brands, Inc.
    Inventors: Dakota Reese Brown, Jorge A. Trujillo, Andrew Boerema
  • Patent number: 11416291
    Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.
    Type: Grant
    Filed: July 12, 2021
    Date of Patent: August 16, 2022
    Assignee: Metacluster LT, UAB
    Inventors: Eivydas Vilcinskas, Arnas Petruskevicius, Giedrius Stalioraitis, Martynas Juravicius, Rimantas Stankevicius
  • Patent number: 11416466
    Abstract: Disclosed are system and methods for processing and storing data files, using a data edge file format. The data edge file separates information about what symbols are in a data file and information about the corresponding location of those symbols in the data file. The described technique for converting a source file comprising symbols into a data edge file includes: generating a locality file of symbol location from the source file to identify locations of the symbols in the source file, generating a symbol file to identify symbols in the source file, and then modifying the locality file of symbol location to associate each symbol from the symbol file with a location in the source file.
    Type: Grant
    Filed: June 1, 2018
    Date of Patent: August 16, 2022
    Assignee: CHAOSSEARCH, INC.
    Inventors: Thomas Hazel, David Noblet, Eric Mann, Grant Mills
  • Patent number: 11386063
    Abstract: Disclosed are system and methods for processing and storing data files, using a data edge file format. The data edge file separates information about what symbols are in a data file and information about the corresponding location of those symbols in the data file. The described technique for converting a source file comprising symbols into a data edge file includes: generating a locality file of symbol location from the source file to identify locations of the symbols in the source file, generating a symbol file to identify symbols in the source file, and then modifying the locality file of symbol location to associate each symbol from the symbol file with a location in the source file.
    Type: Grant
    Filed: May 20, 2021
    Date of Patent: July 12, 2022
    Assignee: CHAOSSEARCH, INC.
    Inventors: Thomas Hazel, David Noblet, Eric Mann, Grant Mills
  • Patent number: 11314930
    Abstract: Implementations described herein determine, for a given document generated by a given source, one or more portions of content (e.g., phrase(s), image(s), paragraph(s), etc.) of the given document that may be influenced by a source perspective of the given source. Further, implementations determine one or more additional resources that are related to the given source and that are related to the portion(s) of content of the given document. Yet further, implementations utilize the additional resource(s) to determine additional content that provides context for the portion(s) that may be influenced by a source perspective. A relationship, between the additional resource(s) and the portions of the given document, can be defined. Based on the relationship being defined, the additional content can be caused to be rendered at a client device in response to the client device accessing the given document.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: April 26, 2022
    Assignee: GOOGLE LLC
    Inventors: Victor Carbune, Thomas Deselaers
  • Patent number: 11237752
    Abstract: An embodiment includes locating source data associated with a data replication request, and identifying a property of a data value of the source data using a reflection process. The embodiment determines a meta-type for the data value using an automaton that identifies a matching extent class from among an indexed set of extent classes that define respective accessors that are invoked to locate an extent class that corresponds with the property of the data value. The embodiment generates target data having the data value and a data type that corresponds with the matching extent class. The embodiment generates a changes feed that includes the generated target data, evaluates data in the changes feed to determine whether a specified condition has been satisfied and, if so, generates a trigger signal that initiates an event, where the trigger signal comprising information that satisfied the specified condition.
    Type: Grant
    Filed: April 21, 2020
    Date of Patent: February 1, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dirk Alexander Seelemann, II, Michael Jory
  • Patent number: 11232365
    Abstract: A digital assistant platform may receive a query relating to a target data file. The target data file may be associated with a standardized data set. The standardized data set may include data files that are aggregated based on data elements associated with the data files, and may be configured to enable a plurality of services. The digital assistant platform may extract, based on the query, one or more keywords, and identify an intent classification and an entity associated with the query. The digital assistant platform may analyze the plurality of services to identify a target service to enable, and analyze the data elements to identify the target data file. The digital assistant platform may determine, using the target service, analytical information associated with a data element of the target data file, and generate a response to the query based on the analytical information.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: January 25, 2022
    Assignee: Accenture Global Solutions Limited
    Inventors: Arun Sundararaman, Uday Kumar Ramamoorthy, Sureshkumar Pargunarajan, Sangeetha Appusamy, A. Deni Xavier Gladis
  • Patent number: 11210266
    Abstract: Systems and methods for collection and management of metadata are disclosed. The methods include maintaining a plurality of data storage systems in communication with an external metadata management system, and operating the metadata management system to store metadata corresponding to data residing on the plurality of data storage systems as a plurality of entries. Each of the plurality of entries include one or more facets of data corresponding to metadata stored in that entry and extracted using text analysis of that metadata.
    Type: Grant
    Filed: January 25, 2019
    Date of Patent: December 28, 2021
    Assignee: International Business Machines Corporation
    Inventors: Joseph Dain, Gregory Kishi, Ron Riffe
  • Patent number: 11194828
    Abstract: Disclosed is a system, method, and computer program product for implementing a log analytics method and system that can configure, collect, and analyze log records in an efficient manner. An improved approach has been described to automatically generate a log parser by analysis of the line content of a log. In addition, an efficient approach has been described to extract key-value content from the log content.
    Type: Grant
    Filed: April 15, 2019
    Date of Patent: December 7, 2021
    Assignee: Oracle International Corporation
    Inventor: Gregory Michael Ferrar
  • Patent number: 10719799
    Abstract: A computer-implemented method and system are provided for optimizing resource usage, wherein the resources include employees of an organization. The method includes collecting employee data including structured data and unstructured data through multiple input channels over at least one network and storing the employee data collected over the multiple input channels in at least one computer memory. The method further includes accessing the computer memory using at least one computer processor and executing instructions to perform multiple operations on the stored data. The operations include transforming the unstructured data into structured data and disambiguating the structured data. The operations additionally include applying rule sets to the transformed data and the structured data to derive a digital productivity footprint for each employee and analyzing the derived digital footprints to optimize resource usage.
    Type: Grant
    Filed: March 13, 2014
    Date of Patent: July 21, 2020
    Assignee: JPMORGAN CHASE BANK, N.A.
    Inventors: William V. Harris, Jonathan B. Teplitz, Ganesh Murugan
  • Patent number: 10585926
    Abstract: Embodiments include method, systems and computer program products for managing structuring of large sets of unstructured data. In some embodiments, a search query may be received from a user via a graphical user interface (GUI). The search query may be parsed to identify a data aspect and a first value. An aspect-value pair may be generated using the data aspect and the first value. A data asset may be generated by associating a type structure to the unstructured data comprising a second value, wherein the type structure comprises the data aspect and the second value. A set of search results may be generated using the first value, wherein the set of search results comprises at least one data asset that matches the first value. Presentation of the set of search results may be facilitated, where the set of search results corresponds to the search query and comprises the data aspect.
    Type: Grant
    Filed: June 14, 2016
    Date of Patent: March 10, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Paul K. Bullis, Geoffrey M. Hambrick
  • Patent number: 10521655
    Abstract: Implementations described herein determine, for a given document generated by a given source, one or more portions of content (e.g., phrase(s), image(s), paragraph(s), etc.) of the given document that may be influenced by bias of the given source. Further, implementations determine one or more additional resources that are related to the given source and that are related to the portion(s) of content of the given document. Yet further, implementations utilize the additional resource(s) to determine additional content that provides context for the portion(s) that may be influenced by bias. A relationship, between the additional resource(s) and the portions of the given document, can be defined. Based on the relationship being defined, the additional content can be caused to be rendered at a client device in response to the client device accessing the given document.
    Type: Grant
    Filed: February 11, 2019
    Date of Patent: December 31, 2019
    Assignee: GOOGLE LLC
    Inventors: Victor Carbune, Thomas Deselaers
  • Patent number: 10185755
    Abstract: In one embodiment, a method receives a query for data in a database system and calls a plurality of engines to analyze information for the query. A calculation graph is generated from at least a portion of the plurality of engines where each of the at least a portion of the plurality of engines add a node to the calculation graph based on analyzing of the information for the query. Then, the method executes the query by calling the nodes of the calculation graph. Each node uses metadata added to the node from a respective calculation engine to perform a calculation for the node. Then, a result of the query is output based on the respective calculations performed by the nodes.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: January 22, 2019
    Assignee: Business Objects Software Limited
    Inventors: Saeed Jahankhani, David Mosimann
  • Patent number: 10185557
    Abstract: A change management system generates change records corresponding to changes to tracked documents, and stores a master control file comprising metadata records that respectively correspond to the tracked documents and which comprise a tracked metadata field and a time field. Responsive to detecting that a change to a given document is of a predefined type, the tracked metadata field in the corresponding metadata record is modified, and the time field in that metadata record is updated accordingly. In response to a problem event, it is determined that the problem event relates to the given document, and, based on the time field, a subset of the change records is selected. Each change record in the subset corresponds to the given document. A problem change record is identified from the change records in the subset, and used to revert the given document to a state previous to the problem event.
    Type: Grant
    Filed: October 29, 2015
    Date of Patent: January 22, 2019
    Assignee: CA, Inc.
    Inventors: Edward Alan Goldstein, Arthur Vincent Richards
  • Patent number: 10146863
    Abstract: Item classification rules are created based on examples selected by a user, such as by selecting a subset of emails, and the rule is used across a larger set of items to obtain automatic classification of similar items according to the rule. Based on an analysis, a candidate classification rule is generated identifying text-based features shared among the items of the subset. The user can review the candidate rule as well as a resultant subset of items generated by the rule, and either accept the candidate rule or make an adjustment to the examples and then perform one or more iterations of the analysis to refine the rule. Adjustments can be made by removing items incorrectly included in a resultant subset and/or adding items incorrectly excluded from a resultant subset, and using the adjusted subset in a next iteration.
    Type: Grant
    Filed: November 17, 2014
    Date of Patent: December 4, 2018
    Assignee: GetGo, Inc.
    Inventors: Ashish V. Thapliyal, Anne Marie Lock, Elizabeth Thapliyal, Ryan W. Kasper, Stefan Alexander von Imhof
  • Patent number: 10067992
    Abstract: A method and system are provided for interactive data visualization. The method includes analyzing a data source used with an initial query to identify a set of default categories available for user selection. The method further includes dynamically determining pre-set values for categorical portions and numerical portions of vocabularies for user selection using data analytics on the data source. The method also includes providing the user with a capability to specify other values for the portions. The method additionally includes dynamically generating multiple sub-queries to the data source for the values for the portions, wherein at least one of the multiple sub-queries is dynamically generated for at least one of the other values specified by the user. The method further includes combining result sets for the multiple sub-queries. The method also includes generating a data visualization of the combined result sets and displaying the data visualization.
    Type: Grant
    Filed: July 13, 2016
    Date of Patent: September 4, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Behrooz Ashtiani, Mitra Khatibi, Diana Lau, Jin Li, Joanna W. Ng
  • Patent number: 9460188
    Abstract: A compatibility processing module, for executing one or more processes to format and manipulate data, such that communication between previously-incompatible data warehouses is facilitated. In particular, a first warehouse is disclosed, wherein the first data warehouse is configured with a compatibility processing module, for receiving a large number of data points, and for executing one or more processes on a stored portion of the received data points such that the resulting processed data points are compatible with formatting conventions of a second data warehouse.
    Type: Grant
    Filed: June 3, 2013
    Date of Patent: October 4, 2016
    Assignee: Bank of America Corporation
    Inventors: Bharath Mundlapudi, Karthik Banala, Rajesh Koneru
  • Patent number: 9223817
    Abstract: Virtual repository management is disclosed. An indication is received that a content management functionality is desired to be available with respect to one or more external content items. A reference object is created for each for the one or more external content items that represents the external content item and enables the content management functionality to be performed with respect to the external content item.
    Type: Grant
    Filed: May 20, 2014
    Date of Patent: December 29, 2015
    Assignee: EMC Corporation
    Inventors: Razmik Abnous, Victor Spivak, Eric Merhoff
  • Patent number: 9213733
    Abstract: The present invention provides a system and method that can search social media and Internet websites, and can analyze and display the results according to a variety of criteria including virality on social media websites. The results are presented in a user friendly format such as a magazine, newsletter, newspaper, or scrapbook.
    Type: Grant
    Filed: April 8, 2015
    Date of Patent: December 15, 2015
    Inventors: Luis Sanchez, Ralf Voellmer
  • Patent number: 9009201
    Abstract: Methods, program products, and systems implementing integrated repository of structured and unstructured data are disclosed. An integrated environment is provided for accessing structured data (e.g., data of a relational database) and unstructured data (e.g., data stored in a text or binary file), including creating, managing, modifying, and searching the structured data and unstructured data. The integrated environment can include an integrated user interface, a set of commands and application programming interface (API), and storage for a relational database and a document repository. The integrated environment can include a database abstraction layer that allows database operations on both the structured data and the unstructured data.
    Type: Grant
    Filed: May 13, 2013
    Date of Patent: April 14, 2015
    Assignee: Apple Inc.
    Inventors: Eric Jacobson, Les Thomas, Gang Zhang
  • Patent number: 8996587
    Abstract: Techniques are provided for automatically structuring free form heterogeneous data. In one aspect of the invention, the techniques include obtaining free form heterogeneous data, segmenting the free form heterogeneous data into one or more units, automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information type, and structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units, e.g., information technology (IT) operations.
    Type: Grant
    Filed: February 15, 2007
    Date of Patent: March 31, 2015
    Assignee: International Business Machines Corporation
    Inventors: Gautam Kar, Ruchi Mahindru, Anca Sailer, Xing Wei
  • Patent number: 8977656
    Abstract: A data structure comprises a clump header table and an inline tree data structure. The inline tree, representing filterable data fields of hierarchically organized data records, comprises an alternating sequence of first-level binary string segments, each followed by one or more corresponding second-level binary string segments. Each clump header record includes an indicator of a location in the inline tree of corresponding binary string segments. A dedicated, specifically adapted conversion program generates the clump header file and the inline tree for storage on any computer-readable medium, and the inline tree can be read entirely into RAM to be searched or filtered. A dedicated, specifically adapted search and filter program is employed to list or enumerate retrieved data records. Run-time computer code generation can reduce time required for searching and filtering. One example includes spatial searching and filtering of data records that include spatial coordinates as data fields.
    Type: Grant
    Filed: January 10, 2012
    Date of Patent: March 10, 2015
    Assignee: Moonshadow Mobile, Inc.
    Inventor: Roy W. Ward
  • Publication number: 20150066996
    Abstract: The present invention discloses a method and a system for automatically collecting the publication digital resource, the method comprises: acquiring the resource document in a digital resource of a publication; identifying the resource document according to a preset identifying rule, obtaining an identified result, the identified result comprises: a document type, a document relation and sequencing; uploading the resource document to a server; generating property information of the resource document according to the identified result; storing the property information to a database. The present invention increases the efficiency of collecting the publication digital resources, saves a large amount of work.
    Type: Application
    Filed: December 2, 2013
    Publication date: March 5, 2015
    Applicants: PEKING UNIVERSITY FOUNDER GROUP CO., LTD., BEIJING FOUNDER ELECTRONICS CO., LTD., FOUNDER INFORMATION INDUSTRY GROUP
    Inventors: Huarui BAI, Changgang CHEN
  • Patent number: 8972423
    Abstract: A system, method, and computer program for parsing a schema across a system to support interoperable machine-to-machine interaction over a network, comprising the steps of communicating a plurality of data in a data defining mark-up language file by a transport protocol stack; parsing said data defining mark-up language to determine at least one opaque schema element; and translating said at least one opaque schema element to a mark-up language string element and appropriate means and computer-readable instructions.
    Type: Grant
    Filed: September 26, 2006
    Date of Patent: March 3, 2015
    Assignee: Siemens Product Lifecycle Management Software Inc.
    Inventors: Puneet Vardhan, Ronald Marchi
  • Patent number: 8949286
    Abstract: When using virtually stored data sets, such as virtual storage access method (VSAM) data sets, while the data set is open (referred to as an open time) static data set characteristics and/or job parameters have been defined for the VSAM data set. In one approach, a method for modifying a virtual storage access method (VSAM) data set includes opening a VSAM data set; and modifying a VSAM control block structure for the VSAM data set.
    Type: Grant
    Filed: February 14, 2013
    Date of Patent: February 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Kam H. Ho, Maya P. Pandya
  • Patent number: 8943100
    Abstract: In a method for storing data in a relational database system using a processor, a collection of values is assigned to a structure dictionary, each of the values represents the value of a row for an attribute and has a unique ordinal number within the collection. and the structure dictionary contains structures defined based on at least one of interaction with a user of the system via an interface, automatic detection of structures occurring in data, automatic detection of frequencies of values occurring in data, analysis of a history of queries, and predetermined information about structures relevant to data content that is stored in the system. For each structure, forming a structure match list from ordinal numbers of values matching the structure, and a structure sub-collection from values matching the structure, using the processor.
    Type: Grant
    Filed: March 13, 2013
    Date of Patent: January 27, 2015
    Assignee: Infobright Inc.
    Inventors: Dominik Slezak, Graham Toppin, Marcin Kowalski, Arkadiusz Wojna
  • Patent number: 8943104
    Abstract: When using virtually stored data sets, such as virtual storage access method (VSAM) data sets, while the data set is open (referred to as an open time) static data set characteristics and/or job parameters have been defined for the VSAM data set. In one approach, even after a data set is opened, a virtually stored control block structure for the data set may be modified, such as by providing a dynamic address space associated with the data set in order to interact with the data set in an environment which allows for a service block request to modify the control block structure, such that data set characteristics and/or job parameters for the data set may be modified during the open time of the data set.
    Type: Grant
    Filed: August 30, 2011
    Date of Patent: January 27, 2015
    Assignee: International Business Machines Corporation
    Inventors: Kam H. Ho, Maya P. Pandya
  • Patent number: 8938522
    Abstract: A content management system (CMS) includes a value-add application with a first set of XML content services, one or more dedicated XML processing servers with a second and other sets of XML content services, and a core CMS with a third set of XML content services. The content management system may be designed to provide XML content services at any of these three tiers of processing. A first threshold is defined that allows the value-add application to determine when to offload XML content services to a dedicated XML processing server. A second threshold is defined that allows the core CMS to determine when to offload XML content services to a dedicated XML processing server. Callback services are included that allow each tier of XML content services to send or receive additional information to complete the XML processing. The result is a content management system that is very powerful and flexible.
    Type: Grant
    Filed: January 30, 2013
    Date of Patent: January 20, 2015
    Assignee: International Business Machines Corporation
    Inventors: David G. Herbeck, John E. Petri
  • Patent number: 8930380
    Abstract: Automatically generating a parser is disclosed. Raw data is received from a first remote device. A determination that the raw data does not, within a predefined confidence measure, conform to any rules included in a set of rules is made. A clustering function is performed on the raw data. At least one parser rule is generated based on the clustering.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: January 6, 2015
    Assignee: Sumo Logic
    Inventors: Kumar Saurabh, Christian Friedrich Beedgen, Bruno Kurtic
  • Patent number: 8924436
    Abstract: Methods, systems, and apparatus, including computer programs stored on computer storage media, for populating a structured presentation with new values. One aspect can be embodied in machine-implemented methods that include the actions of obtaining a plurality of instances and a plurality of attributes; for each instance: identifying one or more documents from an unstructured document collection that are relevant to the instance, where each of the one or more documents include at least a value for an attribute in the plurality of attributes; and establishing a subset of the one or more values as characterizing the instance; and adding each instance, the respective attributes, and the respective subset of values to a structured data collection.
    Type: Grant
    Filed: April 1, 2013
    Date of Patent: December 30, 2014
    Assignee: Google Inc.
    Inventors: Vinicius J. Fortuna, Andriy Bihun, Leonardo A. Laroco, Jr., Daniel Loreto, Elena Erbiceanu, Jeffrey C. Reynar, Andrew William Hogue, Ankur Bhargava
  • Publication number: 20140379761
    Abstract: A method for content extraction and modeling by a computer system for incorporating the content into a domain model including the steps of extracting by an acquisition module content stored on a computer readable medium of at least one data source, determining whether the content is structured or unstructured; wherein structured content has a first content model associated therewith defining at least a format of the structured content and unstructured content has no model associated therewith. Upon a condition in which the content is structured, incorporating the structured content into the domain model; and, upon a condition in which the content is unstructured, determining by the computer system a second content model to transform the unstructured content into newly structured content and incorporating the newly structured content into the domain model.
    Type: Application
    Filed: June 25, 2013
    Publication date: December 25, 2014
    Inventors: Daniel Ryan ADAMSON, Victor Fernandovich Comas LIJACHEV
  • Publication number: 20140358975
    Abstract: A server includes a data ingestion module with executable instructions executed by a processor to produce inferred data types from received data and utilize the inferred data types to augment the received data with computed values that aggregate the received data along a hierarchical dimension.
    Type: Application
    Filed: May 30, 2014
    Publication date: December 4, 2014
    Applicant: ClearStory Data Inc.
    Inventors: Vaibhav Nivargi, Nicholas Briggs, Douglas van der Molen, Kevin Malone, Mikhail Bautin
  • Publication number: 20140337390
    Abstract: Example methods, apparatuses, or articles of manufacture are disclosed that may be implemented using one or more computing devices or platforms to facilitate or otherwise support one or more processes or operations associated with a space-time-node engine signal structure.
    Type: Application
    Filed: June 3, 2014
    Publication date: November 13, 2014
    Applicant: Space-Time Insight
    Inventor: Krishna Kumar
  • Patent number: 8886679
    Abstract: Systems and methods of mining Web applications are disclosed. An example method may include selecting a distance measure between subtrees of the Web applications. The method may also include computing an edit distance for the subtrees. The method may also include finding sets of similar subtrees using the edit distance with a Similar Sibling-Trees-aware RTDM (SiSTeR) approach.
    Type: Grant
    Filed: October 11, 2011
    Date of Patent: November 11, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Omer Barkol, Ruth Bergman, Shahar Golan
  • Patent number: 8886580
    Abstract: Aggregate scoring is used to help classify digital content such as content uploaded to multi-user websites (e.g., social networking websites). In one embodiment, specific categories are used that relate to a social implication of content. For example, text, images, audio or other data formats can provide communication perceived to fall into categories such as violent, abusive, rights management, pornographic or other types of communication. The categories are used to provide a raw score to items in various groupings of a site's content. Where items are related to other items such as by organizational, social, legal, data-driven, design methods, or by other principles or definitions, the related items' raw scores are aggregated to achieve a score for a particular grouping of items that reflects, at least in part, scores from two or more of the related items.
    Type: Grant
    Filed: February 28, 2013
    Date of Patent: November 11, 2014
    Assignee: Lithium Technologies, Inc.
    Inventors: Pierre Grenier, Eddie Lo, Satinderpal Sandhu
  • Patent number: 8880732
    Abstract: Method and system for application isolation is provided. A computing system executes a plurality of processor executable applications and each application is assigned an address by an adapter that is operationally coupled to the computing system. The adapter also assigns a unique identifier to each construct generated by the plurality of applications; maintains a data structure associating the address with the unique identifier assigned to each construct; receives a request having an address and a unique identifier from an application from among the plurality of applications; retrieves the address and the unique identifier from the request; compares the retrieved address and the unique identifier with information maintained by the data structure; and processes the request if the retrieved address and the unique identifier matches with information of the data structure.
    Type: Grant
    Filed: February 25, 2011
    Date of Patent: November 4, 2014
    Assignee: QLOGIC, Corporation
    Inventor: Kanoj Sarcar
  • Publication number: 20140317155
    Abstract: A data source, such as a web page, a locally retrieved document, user-entered information, etc., is made visible to a user via a display, such as a computer monitor or touch-screen tablet or smart phone screen. A data capture window, which may be in the form of a data grid, is also displayed to the user, who can select data items from the data source such that they are represented in the data grid. Some data items may also be identified and collected automatically. Data collected into the capture window is then associated with corresponding portions of records in a data base.
    Type: Application
    Filed: May 15, 2014
    Publication date: October 23, 2014
    Applicant: SEARCHISTICS LLC
    Inventors: VARDA TREIBACH-HECK, IAN D. HART
  • Patent number: 8868621
    Abstract: The Computer-implemented system, method or computer program that creates a data table of rows and columns from an HTML Web page or document independent of the HTML markup tags. Data embedded in the HTML is identified using clustering of text and extracted into a data table. The generation of data tables can be performed in real-time and is not subject to problems with malformed or poorly created HTML.
    Type: Grant
    Filed: October 20, 2011
    Date of Patent: October 21, 2014
    Assignee: Rillip, Inc.
    Inventors: Ray J. D'Onofrio, II, Ray J. D'Onofrio
  • Patent number: 8862636
    Abstract: A computer-implemented method for use with an extensible markup language (XML) document includes inputting a high-level mapping specification for a schema mapping; and generating a target XML document based on the mapping. The method may perform schema mapping-based XML transformation as a three-phase process comprising tuple extraction, XML-fragment generation, and data merging. The tuple extraction phase may be adapted to handle streamed XML data (as well as stored/indexed XML data). The data merging phase may use a hybrid method that can dynamically switch between main memory-based and disk-based algorithms based on the size of the XML data to be merged.
    Type: Grant
    Filed: June 5, 2008
    Date of Patent: October 14, 2014
    Assignee: International Business Machines Corporation
    Inventors: Wook-Shin Han, Ching-Tien Ho, Haifeng Jiang, Lucian Popa
  • Patent number: 8856186
    Abstract: Methods, systems, and computer program products for memory garbage collection via object grouping are disclosed. Normative objects are allocated in a non-native memory referencing respective one or more native objects in a native memory. The non-native objects are grouped into non-native object groups based upon information regarding object references associated with respective ones of the native objects. Objects in respective ones of the one or more non-native object groups are provided one or more references to other said objects of the non-native object groups to create connected object groups. Unused ones of the connected object groups are detected. And objects of the detected unused ones of the connected object groups are de-allocated.
    Type: Grant
    Filed: June 29, 2011
    Date of Patent: October 7, 2014
    Assignee: Google Inc.
    Inventors: Peng Li, Dominic Cooney
  • Patent number: 8825716
    Abstract: Business intelligence, decision support and knowledge management network systems and methods provide efficient, robust, and business-friendly services for the rapid analysis of massive amounts of business, electronic, and other disparate data into actionable intelligence. An advantageous element of the multi-party knowledge network allows the aggregation of common data formats in order to analyze a combined dataset consisting of information from multiple parties, thereby providing additional business intelligence than with a single set of data alone.
    Type: Grant
    Filed: March 6, 2014
    Date of Patent: September 2, 2014
    Assignee: Cross Commerce Media, Inc.
    Inventors: Russell William Martin, Jr., Michael Martinov, Heidi Messer, Stephen Messer
  • Patent number: 8819087
    Abstract: Systems and methods for assisting one or more users in the production of a media work from a plurality of recordings are disclosed. In an exemplary embodiment, digital versions of the recordings are stored on a file-storage server accessible over a communications network. A media-library server provides an organized way for users to access and view the digital versions, and to associate various data files with the digital versions, such as transcription files, logging files, and storyboard documents. The exemplary media library generates web pages that enable users to search and access the digital versions from an internet browser running on a conventional personal computer. The browser also enables users to view the digital versions with a browser-compatible media viewer.
    Type: Grant
    Filed: September 5, 2012
    Date of Patent: August 26, 2014
    Assignee: Thomas Majchrowski & Associates, Inc.
    Inventor: Keri DeWitt
  • Patent number: 8812562
    Abstract: Embodiments described herein provide numerous applications and implementations of a social network to facilitate individuals to resolve various life issues. These issues may include issues that arise when individuals or families relocate, including logistic problems, assimilation of family members in a community, and roommate pairings. As will be described, embodiments described herein greatly facilitate corporations in relocating their employees logistically, and also assist employees and their families with life issues that may determine whether the employees' relocation will be a success.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: August 19, 2014
    Inventor: Emily White
  • Patent number: 8812643
    Abstract: Encoding and/or decoding of messages. On the encoding end, a composite encoder encodes message from an internal format that is used by internal system components into an external format. However, the composite encoder may encode the outgoing messages into different external formats on a per-message basis. For incoming message, a composite decoder decodes incoming messages from any one of a plurality of external formats into the internal format also on a per-message basis. A per-message report mechanism permits internal system components and the encoding/decoding components to communicate information regarding the encoding or decoding on a per message basis. XML messages can be converted automatically into Binary messages.
    Type: Grant
    Filed: July 5, 2011
    Date of Patent: August 19, 2014
    Assignee: Microsoft Corporation
    Inventors: Stephen Jared Maine, Michael J. Coulson, Tirunelveli R. Vishwanath, Erik B. Christensen
  • Patent number: 8782101
    Abstract: Methods and systems for transferring data across incompatible database platforms are provided. A method may include reading data from a database export file containing a representation of objects and data from a source database of a first database platform, where the file was generated by an export utility of the first database platform, has a format specific to the first database platform, and is stored outside of the source database. The data read from the file is parsed based on the format. At least one database table represented in the file is identified based on the parsing. Row-level data associated with the at least one database table is also identified based on the parsing. The row-level data is collected in a buffer and is then inserted into a destination database of a second database platform using at least one dedicated data insertion thread.
    Type: Grant
    Filed: January 20, 2012
    Date of Patent: July 15, 2014
    Assignee: Google Inc.
    Inventor: Matthew Aaron Moore
  • Patent number: 8768928
    Abstract: DOM based unique ID generation, including receiving a hypertext markup language (HTML) page at a computer, and identifying HTML page elements in response to the receiving, the HTML page elements comprising parent nodes, the parent nodes comprising child nodes. The method further comprising processing each of the HTML page elements, the processing comprising: grouping the child nodes by parent node into a group of child nodes, detecting patterns in the group of child nodes in response to the grouping, reducing the group of child nodes to text strings in response to the detecting, storing the text strings as text values in the parent nodes, and generating a unique identifier (ID) of the HTML page in response to the processing.
    Type: Grant
    Filed: March 5, 2012
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Khalil Ayoub, Hosam Aly, Jason Walsh
  • Patent number: 8762829
    Abstract: A computer-implemented method to determine a robust wrapper includes developing a model indicative of the temporal history of a document, such as a web document written in a markup language. Based on the developed model, robustness characteristics are determined for a plurality of different wrappers representing associated paths to the data item in a representation of the document. Based on a result of the determining operation, a result wrapper of the plurality of wrappers is provided. The result wrapper has a desired robustness characteristic.
    Type: Grant
    Filed: December 24, 2008
    Date of Patent: June 24, 2014
    Assignee: Yahoo! Inc.
    Inventors: Nilesh Dalvi, Philip Bohannon, Fei Sha