From Unstructured Or Semi-structured Data To Structured Data Patents (Class 707/811)
-
Patent number: 12164869Abstract: An apparatus comprises a processing device configured to generate and utilize an automated document parser for parsing documents exchanged as part of fulfillment of historical component orders by a given supplier in a supply chain for first and second organizations to identify first and second organization-specific component identifiers utilized by the first and second organizations and a given supplier-specific component identifier utilized by the given supplier for a given component.Type: GrantFiled: April 22, 2022Date of Patent: December 10, 2024Assignee: Dell Products L.P.Inventors: Shibi Panikkar, Rohit Gosain
-
Patent number: 12079243Abstract: Various methods, apparatuses/systems, and media for data transmission are disclosed. A processor accesses a database that stores data relating to one or more transactions; implements a replication tool that is configured for a table within the table. When a row is added to the table or modified in the table, the replication tool identifies the added or modified row. The processor publishes, by utilizing the replication tool, the data associated with the added or modified row onto a local message queue (MQ); reads the published data from the local MQ; converts the data into a configuration file having a predefined file format; parses the data from the configuration file; creates a predefined payload based on the parsed data; and transmits the predefined payload onto a log database via a data bus.Type: GrantFiled: December 14, 2021Date of Patent: September 3, 2024Assignee: JPMORGAN CHASE BANK, N.A.Inventors: Praveen Bangera, Harishkumar Lakshminarasimhamurthy, Suhas Mathakari, Ramgopal Reddy Gonnuru
-
Patent number: 11797526Abstract: It may be desired to query a large record of events to retrieve certain segments of the data that provide information of interest. However, the data might not be structured such that it is conducive to generate/execute queries that inquire about net changes of events over time. Query language may also be interpreted differently by different analysts. In some embodiments data related to events is assembled into a data structure having a format logically equivalent to a double entry transition table. Each event may have two entries in the table: a first entry representing the event as a transition from a previous state to a current state, and a second entry representing the reverse transition of the first entry. In some embodiments, a query executor tool may be provided as a graphical user interface for entering information of interest to be queried.Type: GrantFiled: September 16, 2021Date of Patent: October 24, 2023Assignee: SHOPIFY INC.Inventor: Benjamin Alfred Lucas Doyle
-
Patent number: 11645464Abstract: Systems, computer-implemented methods, and computer program products to transform a lexicon that describes an information asset are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a term validation component that can determine from a subject matter expert, a validated term that can indicate validation of a candidate term that describes an information asset. The computer executable components can further comprise a lexicon transforming component that, based on the validated term, can transform a lexicon that describes the information asset, by incorporating the validated term into the lexicon.Type: GrantFiled: March 18, 2021Date of Patent: May 9, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Anna Lisa Gentile, Chad Eric DeLuca, Petar Ristoski, Ismini Lourentzou, Linda Ha Kato, Alfredo Alba, Daniel Gruhl, Steven R. Welch
-
Patent number: 11538594Abstract: Implementations provide a method to consolidate data records of regimens for treating oncology conditions. The method includes: accessing data records each encoding multi-tier data characteristics of a regimen for treating a particular oncology condition; receiving a first data record encoding a first regimen specific to a first healthcare provider institution; parsing the first data record according to a hierarchy of the encoded multi-tier data characteristics; distributing a respective weight to each of the encoded data characteristics to account for the potentially missing data characteristic; comparing data characteristics of the first data record with data characteristics from the data records by applying the respective weight to each data characteristic at a particular tier of the hierarchy such that a respective compound score is generated for each data record; and based on the compound scores for all data records, determining a prevailing data record of regimen as matching the first data record.Type: GrantFiled: October 19, 2020Date of Patent: December 27, 2022Assignee: IQVIA Inc.Inventors: Sanam Sikander, Edmund Drage
-
Patent number: 11494786Abstract: A third party item listing management system usable for verification of third party items to be included on a retailer website includes an application programming interface and an item verification pipeline. The application programming interface is accessible by a plurality of third parties and is configured to receive item data associated with one or more items. The item verification pipeline is configured to receive the item data and call an item validation pipeline, the item validation pipeline includes a plurality of item validation stages including a field verification. At the field verification stage, data is extracted from at least one of an item image or text associated with the item. The data extracted is compared to item data obtained from an independent verification source to confirm the accuracy of the item data provided by the third party.Type: GrantFiled: February 7, 2020Date of Patent: November 8, 2022Assignee: Target Brands, Inc.Inventors: Dakota Reese Brown, Jorge A. Trujillo, Andrew Boerema
-
Patent number: 11416466Abstract: Disclosed are system and methods for processing and storing data files, using a data edge file format. The data edge file separates information about what symbols are in a data file and information about the corresponding location of those symbols in the data file. The described technique for converting a source file comprising symbols into a data edge file includes: generating a locality file of symbol location from the source file to identify locations of the symbols in the source file, generating a symbol file to identify symbols in the source file, and then modifying the locality file of symbol location to associate each symbol from the symbol file with a location in the source file.Type: GrantFiled: June 1, 2018Date of Patent: August 16, 2022Assignee: CHAOSSEARCH, INC.Inventors: Thomas Hazel, David Noblet, Eric Mann, Grant Mills
-
Patent number: 11416291Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: GrantFiled: July 12, 2021Date of Patent: August 16, 2022Assignee: Metacluster LT, UABInventors: Eivydas Vilcinskas, Arnas Petruskevicius, Giedrius Stalioraitis, Martynas Juravicius, Rimantas Stankevicius
-
Patent number: 11386063Abstract: Disclosed are system and methods for processing and storing data files, using a data edge file format. The data edge file separates information about what symbols are in a data file and information about the corresponding location of those symbols in the data file. The described technique for converting a source file comprising symbols into a data edge file includes: generating a locality file of symbol location from the source file to identify locations of the symbols in the source file, generating a symbol file to identify symbols in the source file, and then modifying the locality file of symbol location to associate each symbol from the symbol file with a location in the source file.Type: GrantFiled: May 20, 2021Date of Patent: July 12, 2022Assignee: CHAOSSEARCH, INC.Inventors: Thomas Hazel, David Noblet, Eric Mann, Grant Mills
-
Patent number: 11314930Abstract: Implementations described herein determine, for a given document generated by a given source, one or more portions of content (e.g., phrase(s), image(s), paragraph(s), etc.) of the given document that may be influenced by a source perspective of the given source. Further, implementations determine one or more additional resources that are related to the given source and that are related to the portion(s) of content of the given document. Yet further, implementations utilize the additional resource(s) to determine additional content that provides context for the portion(s) that may be influenced by a source perspective. A relationship, between the additional resource(s) and the portions of the given document, can be defined. Based on the relationship being defined, the additional content can be caused to be rendered at a client device in response to the client device accessing the given document.Type: GrantFiled: December 30, 2019Date of Patent: April 26, 2022Assignee: GOOGLE LLCInventors: Victor Carbune, Thomas Deselaers
-
Patent number: 11237752Abstract: An embodiment includes locating source data associated with a data replication request, and identifying a property of a data value of the source data using a reflection process. The embodiment determines a meta-type for the data value using an automaton that identifies a matching extent class from among an indexed set of extent classes that define respective accessors that are invoked to locate an extent class that corresponds with the property of the data value. The embodiment generates target data having the data value and a data type that corresponds with the matching extent class. The embodiment generates a changes feed that includes the generated target data, evaluates data in the changes feed to determine whether a specified condition has been satisfied and, if so, generates a trigger signal that initiates an event, where the trigger signal comprising information that satisfied the specified condition.Type: GrantFiled: April 21, 2020Date of Patent: February 1, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Dirk Alexander Seelemann, II, Michael Jory
-
Patent number: 11232365Abstract: A digital assistant platform may receive a query relating to a target data file. The target data file may be associated with a standardized data set. The standardized data set may include data files that are aggregated based on data elements associated with the data files, and may be configured to enable a plurality of services. The digital assistant platform may extract, based on the query, one or more keywords, and identify an intent classification and an entity associated with the query. The digital assistant platform may analyze the plurality of services to identify a target service to enable, and analyze the data elements to identify the target data file. The digital assistant platform may determine, using the target service, analytical information associated with a data element of the target data file, and generate a response to the query based on the analytical information.Type: GrantFiled: October 22, 2019Date of Patent: January 25, 2022Assignee: Accenture Global Solutions LimitedInventors: Arun Sundararaman, Uday Kumar Ramamoorthy, Sureshkumar Pargunarajan, Sangeetha Appusamy, A. Deni Xavier Gladis
-
Patent number: 11210266Abstract: Systems and methods for collection and management of metadata are disclosed. The methods include maintaining a plurality of data storage systems in communication with an external metadata management system, and operating the metadata management system to store metadata corresponding to data residing on the plurality of data storage systems as a plurality of entries. Each of the plurality of entries include one or more facets of data corresponding to metadata stored in that entry and extracted using text analysis of that metadata.Type: GrantFiled: January 25, 2019Date of Patent: December 28, 2021Assignee: International Business Machines CorporationInventors: Joseph Dain, Gregory Kishi, Ron Riffe
-
Patent number: 11194828Abstract: Disclosed is a system, method, and computer program product for implementing a log analytics method and system that can configure, collect, and analyze log records in an efficient manner. An improved approach has been described to automatically generate a log parser by analysis of the line content of a log. In addition, an efficient approach has been described to extract key-value content from the log content.Type: GrantFiled: April 15, 2019Date of Patent: December 7, 2021Assignee: Oracle International CorporationInventor: Gregory Michael Ferrar
-
Patent number: 10719799Abstract: A computer-implemented method and system are provided for optimizing resource usage, wherein the resources include employees of an organization. The method includes collecting employee data including structured data and unstructured data through multiple input channels over at least one network and storing the employee data collected over the multiple input channels in at least one computer memory. The method further includes accessing the computer memory using at least one computer processor and executing instructions to perform multiple operations on the stored data. The operations include transforming the unstructured data into structured data and disambiguating the structured data. The operations additionally include applying rule sets to the transformed data and the structured data to derive a digital productivity footprint for each employee and analyzing the derived digital footprints to optimize resource usage.Type: GrantFiled: March 13, 2014Date of Patent: July 21, 2020Assignee: JPMORGAN CHASE BANK, N.A.Inventors: William V. Harris, Jonathan B. Teplitz, Ganesh Murugan
-
Patent number: 10585926Abstract: Embodiments include method, systems and computer program products for managing structuring of large sets of unstructured data. In some embodiments, a search query may be received from a user via a graphical user interface (GUI). The search query may be parsed to identify a data aspect and a first value. An aspect-value pair may be generated using the data aspect and the first value. A data asset may be generated by associating a type structure to the unstructured data comprising a second value, wherein the type structure comprises the data aspect and the second value. A set of search results may be generated using the first value, wherein the set of search results comprises at least one data asset that matches the first value. Presentation of the set of search results may be facilitated, where the set of search results corresponds to the search query and comprises the data aspect.Type: GrantFiled: June 14, 2016Date of Patent: March 10, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Paul K. Bullis, Geoffrey M. Hambrick
-
Patent number: 10521655Abstract: Implementations described herein determine, for a given document generated by a given source, one or more portions of content (e.g., phrase(s), image(s), paragraph(s), etc.) of the given document that may be influenced by bias of the given source. Further, implementations determine one or more additional resources that are related to the given source and that are related to the portion(s) of content of the given document. Yet further, implementations utilize the additional resource(s) to determine additional content that provides context for the portion(s) that may be influenced by bias. A relationship, between the additional resource(s) and the portions of the given document, can be defined. Based on the relationship being defined, the additional content can be caused to be rendered at a client device in response to the client device accessing the given document.Type: GrantFiled: February 11, 2019Date of Patent: December 31, 2019Assignee: GOOGLE LLCInventors: Victor Carbune, Thomas Deselaers
-
Patent number: 10185755Abstract: In one embodiment, a method receives a query for data in a database system and calls a plurality of engines to analyze information for the query. A calculation graph is generated from at least a portion of the plurality of engines where each of the at least a portion of the plurality of engines add a node to the calculation graph based on analyzing of the information for the query. Then, the method executes the query by calling the nodes of the calculation graph. Each node uses metadata added to the node from a respective calculation engine to perform a calculation for the node. Then, a result of the query is output based on the respective calculations performed by the nodes.Type: GrantFiled: December 28, 2015Date of Patent: January 22, 2019Assignee: Business Objects Software LimitedInventors: Saeed Jahankhani, David Mosimann
-
Patent number: 10185557Abstract: A change management system generates change records corresponding to changes to tracked documents, and stores a master control file comprising metadata records that respectively correspond to the tracked documents and which comprise a tracked metadata field and a time field. Responsive to detecting that a change to a given document is of a predefined type, the tracked metadata field in the corresponding metadata record is modified, and the time field in that metadata record is updated accordingly. In response to a problem event, it is determined that the problem event relates to the given document, and, based on the time field, a subset of the change records is selected. Each change record in the subset corresponds to the given document. A problem change record is identified from the change records in the subset, and used to revert the given document to a state previous to the problem event.Type: GrantFiled: October 29, 2015Date of Patent: January 22, 2019Assignee: CA, Inc.Inventors: Edward Alan Goldstein, Arthur Vincent Richards
-
Patent number: 10146863Abstract: Item classification rules are created based on examples selected by a user, such as by selecting a subset of emails, and the rule is used across a larger set of items to obtain automatic classification of similar items according to the rule. Based on an analysis, a candidate classification rule is generated identifying text-based features shared among the items of the subset. The user can review the candidate rule as well as a resultant subset of items generated by the rule, and either accept the candidate rule or make an adjustment to the examples and then perform one or more iterations of the analysis to refine the rule. Adjustments can be made by removing items incorrectly included in a resultant subset and/or adding items incorrectly excluded from a resultant subset, and using the adjusted subset in a next iteration.Type: GrantFiled: November 17, 2014Date of Patent: December 4, 2018Assignee: GetGo, Inc.Inventors: Ashish V. Thapliyal, Anne Marie Lock, Elizabeth Thapliyal, Ryan W. Kasper, Stefan Alexander von Imhof
-
Patent number: 10067992Abstract: A method and system are provided for interactive data visualization. The method includes analyzing a data source used with an initial query to identify a set of default categories available for user selection. The method further includes dynamically determining pre-set values for categorical portions and numerical portions of vocabularies for user selection using data analytics on the data source. The method also includes providing the user with a capability to specify other values for the portions. The method additionally includes dynamically generating multiple sub-queries to the data source for the values for the portions, wherein at least one of the multiple sub-queries is dynamically generated for at least one of the other values specified by the user. The method further includes combining result sets for the multiple sub-queries. The method also includes generating a data visualization of the combined result sets and displaying the data visualization.Type: GrantFiled: July 13, 2016Date of Patent: September 4, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Behrooz Ashtiani, Mitra Khatibi, Diana Lau, Jin Li, Joanna W. Ng
-
Patent number: 9460188Abstract: A compatibility processing module, for executing one or more processes to format and manipulate data, such that communication between previously-incompatible data warehouses is facilitated. In particular, a first warehouse is disclosed, wherein the first data warehouse is configured with a compatibility processing module, for receiving a large number of data points, and for executing one or more processes on a stored portion of the received data points such that the resulting processed data points are compatible with formatting conventions of a second data warehouse.Type: GrantFiled: June 3, 2013Date of Patent: October 4, 2016Assignee: Bank of America CorporationInventors: Bharath Mundlapudi, Karthik Banala, Rajesh Koneru
-
Patent number: 9223817Abstract: Virtual repository management is disclosed. An indication is received that a content management functionality is desired to be available with respect to one or more external content items. A reference object is created for each for the one or more external content items that represents the external content item and enables the content management functionality to be performed with respect to the external content item.Type: GrantFiled: May 20, 2014Date of Patent: December 29, 2015Assignee: EMC CorporationInventors: Razmik Abnous, Victor Spivak, Eric Merhoff
-
Patent number: 9213733Abstract: The present invention provides a system and method that can search social media and Internet websites, and can analyze and display the results according to a variety of criteria including virality on social media websites. The results are presented in a user friendly format such as a magazine, newsletter, newspaper, or scrapbook.Type: GrantFiled: April 8, 2015Date of Patent: December 15, 2015Inventors: Luis Sanchez, Ralf Voellmer
-
Patent number: 9009201Abstract: Methods, program products, and systems implementing integrated repository of structured and unstructured data are disclosed. An integrated environment is provided for accessing structured data (e.g., data of a relational database) and unstructured data (e.g., data stored in a text or binary file), including creating, managing, modifying, and searching the structured data and unstructured data. The integrated environment can include an integrated user interface, a set of commands and application programming interface (API), and storage for a relational database and a document repository. The integrated environment can include a database abstraction layer that allows database operations on both the structured data and the unstructured data.Type: GrantFiled: May 13, 2013Date of Patent: April 14, 2015Assignee: Apple Inc.Inventors: Eric Jacobson, Les Thomas, Gang Zhang
-
Patent number: 8996587Abstract: Techniques are provided for automatically structuring free form heterogeneous data. In one aspect of the invention, the techniques include obtaining free form heterogeneous data, segmenting the free form heterogeneous data into one or more units, automatically labeling the one or more units based on one or more machine learning techniques, wherein each unit is associated with a label indicating an information type, and structuring the one or more labeled units in a format to facilitate one or more operations that use at least a portion of the labeled units, e.g., information technology (IT) operations.Type: GrantFiled: February 15, 2007Date of Patent: March 31, 2015Assignee: International Business Machines CorporationInventors: Gautam Kar, Ruchi Mahindru, Anca Sailer, Xing Wei
-
Patent number: 8977656Abstract: A data structure comprises a clump header table and an inline tree data structure. The inline tree, representing filterable data fields of hierarchically organized data records, comprises an alternating sequence of first-level binary string segments, each followed by one or more corresponding second-level binary string segments. Each clump header record includes an indicator of a location in the inline tree of corresponding binary string segments. A dedicated, specifically adapted conversion program generates the clump header file and the inline tree for storage on any computer-readable medium, and the inline tree can be read entirely into RAM to be searched or filtered. A dedicated, specifically adapted search and filter program is employed to list or enumerate retrieved data records. Run-time computer code generation can reduce time required for searching and filtering. One example includes spatial searching and filtering of data records that include spatial coordinates as data fields.Type: GrantFiled: January 10, 2012Date of Patent: March 10, 2015Assignee: Moonshadow Mobile, Inc.Inventor: Roy W. Ward
-
Publication number: 20150066996Abstract: The present invention discloses a method and a system for automatically collecting the publication digital resource, the method comprises: acquiring the resource document in a digital resource of a publication; identifying the resource document according to a preset identifying rule, obtaining an identified result, the identified result comprises: a document type, a document relation and sequencing; uploading the resource document to a server; generating property information of the resource document according to the identified result; storing the property information to a database. The present invention increases the efficiency of collecting the publication digital resources, saves a large amount of work.Type: ApplicationFiled: December 2, 2013Publication date: March 5, 2015Applicants: PEKING UNIVERSITY FOUNDER GROUP CO., LTD., BEIJING FOUNDER ELECTRONICS CO., LTD., FOUNDER INFORMATION INDUSTRY GROUPInventors: Huarui BAI, Changgang CHEN
-
Patent number: 8972423Abstract: A system, method, and computer program for parsing a schema across a system to support interoperable machine-to-machine interaction over a network, comprising the steps of communicating a plurality of data in a data defining mark-up language file by a transport protocol stack; parsing said data defining mark-up language to determine at least one opaque schema element; and translating said at least one opaque schema element to a mark-up language string element and appropriate means and computer-readable instructions.Type: GrantFiled: September 26, 2006Date of Patent: March 3, 2015Assignee: Siemens Product Lifecycle Management Software Inc.Inventors: Puneet Vardhan, Ronald Marchi
-
Patent number: 8949286Abstract: When using virtually stored data sets, such as virtual storage access method (VSAM) data sets, while the data set is open (referred to as an open time) static data set characteristics and/or job parameters have been defined for the VSAM data set. In one approach, a method for modifying a virtual storage access method (VSAM) data set includes opening a VSAM data set; and modifying a VSAM control block structure for the VSAM data set.Type: GrantFiled: February 14, 2013Date of Patent: February 3, 2015Assignee: International Business Machines CorporationInventors: Kam H. Ho, Maya P. Pandya
-
Patent number: 8943100Abstract: In a method for storing data in a relational database system using a processor, a collection of values is assigned to a structure dictionary, each of the values represents the value of a row for an attribute and has a unique ordinal number within the collection. and the structure dictionary contains structures defined based on at least one of interaction with a user of the system via an interface, automatic detection of structures occurring in data, automatic detection of frequencies of values occurring in data, analysis of a history of queries, and predetermined information about structures relevant to data content that is stored in the system. For each structure, forming a structure match list from ordinal numbers of values matching the structure, and a structure sub-collection from values matching the structure, using the processor.Type: GrantFiled: March 13, 2013Date of Patent: January 27, 2015Assignee: Infobright Inc.Inventors: Dominik Slezak, Graham Toppin, Marcin Kowalski, Arkadiusz Wojna
-
Patent number: 8943104Abstract: When using virtually stored data sets, such as virtual storage access method (VSAM) data sets, while the data set is open (referred to as an open time) static data set characteristics and/or job parameters have been defined for the VSAM data set. In one approach, even after a data set is opened, a virtually stored control block structure for the data set may be modified, such as by providing a dynamic address space associated with the data set in order to interact with the data set in an environment which allows for a service block request to modify the control block structure, such that data set characteristics and/or job parameters for the data set may be modified during the open time of the data set.Type: GrantFiled: August 30, 2011Date of Patent: January 27, 2015Assignee: International Business Machines CorporationInventors: Kam H. Ho, Maya P. Pandya
-
Patent number: 8938522Abstract: A content management system (CMS) includes a value-add application with a first set of XML content services, one or more dedicated XML processing servers with a second and other sets of XML content services, and a core CMS with a third set of XML content services. The content management system may be designed to provide XML content services at any of these three tiers of processing. A first threshold is defined that allows the value-add application to determine when to offload XML content services to a dedicated XML processing server. A second threshold is defined that allows the core CMS to determine when to offload XML content services to a dedicated XML processing server. Callback services are included that allow each tier of XML content services to send or receive additional information to complete the XML processing. The result is a content management system that is very powerful and flexible.Type: GrantFiled: January 30, 2013Date of Patent: January 20, 2015Assignee: International Business Machines CorporationInventors: David G. Herbeck, John E. Petri
-
Patent number: 8930380Abstract: Automatically generating a parser is disclosed. Raw data is received from a first remote device. A determination that the raw data does not, within a predefined confidence measure, conform to any rules included in a set of rules is made. A clustering function is performed on the raw data. At least one parser rule is generated based on the clustering.Type: GrantFiled: June 30, 2011Date of Patent: January 6, 2015Assignee: Sumo LogicInventors: Kumar Saurabh, Christian Friedrich Beedgen, Bruno Kurtic
-
Patent number: 8924436Abstract: Methods, systems, and apparatus, including computer programs stored on computer storage media, for populating a structured presentation with new values. One aspect can be embodied in machine-implemented methods that include the actions of obtaining a plurality of instances and a plurality of attributes; for each instance: identifying one or more documents from an unstructured document collection that are relevant to the instance, where each of the one or more documents include at least a value for an attribute in the plurality of attributes; and establishing a subset of the one or more values as characterizing the instance; and adding each instance, the respective attributes, and the respective subset of values to a structured data collection.Type: GrantFiled: April 1, 2013Date of Patent: December 30, 2014Assignee: Google Inc.Inventors: Vinicius J. Fortuna, Andriy Bihun, Leonardo A. Laroco, Jr., Daniel Loreto, Elena Erbiceanu, Jeffrey C. Reynar, Andrew William Hogue, Ankur Bhargava
-
Publication number: 20140379761Abstract: A method for content extraction and modeling by a computer system for incorporating the content into a domain model including the steps of extracting by an acquisition module content stored on a computer readable medium of at least one data source, determining whether the content is structured or unstructured; wherein structured content has a first content model associated therewith defining at least a format of the structured content and unstructured content has no model associated therewith. Upon a condition in which the content is structured, incorporating the structured content into the domain model; and, upon a condition in which the content is unstructured, determining by the computer system a second content model to transform the unstructured content into newly structured content and incorporating the newly structured content into the domain model.Type: ApplicationFiled: June 25, 2013Publication date: December 25, 2014Inventors: Daniel Ryan ADAMSON, Victor Fernandovich Comas LIJACHEV
-
Publication number: 20140358975Abstract: A server includes a data ingestion module with executable instructions executed by a processor to produce inferred data types from received data and utilize the inferred data types to augment the received data with computed values that aggregate the received data along a hierarchical dimension.Type: ApplicationFiled: May 30, 2014Publication date: December 4, 2014Applicant: ClearStory Data Inc.Inventors: Vaibhav Nivargi, Nicholas Briggs, Douglas van der Molen, Kevin Malone, Mikhail Bautin
-
Publication number: 20140337390Abstract: Example methods, apparatuses, or articles of manufacture are disclosed that may be implemented using one or more computing devices or platforms to facilitate or otherwise support one or more processes or operations associated with a space-time-node engine signal structure.Type: ApplicationFiled: June 3, 2014Publication date: November 13, 2014Applicant: Space-Time InsightInventor: Krishna Kumar
-
Patent number: 8886679Abstract: Systems and methods of mining Web applications are disclosed. An example method may include selecting a distance measure between subtrees of the Web applications. The method may also include computing an edit distance for the subtrees. The method may also include finding sets of similar subtrees using the edit distance with a Similar Sibling-Trees-aware RTDM (SiSTeR) approach.Type: GrantFiled: October 11, 2011Date of Patent: November 11, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventors: Omer Barkol, Ruth Bergman, Shahar Golan
-
Patent number: 8886580Abstract: Aggregate scoring is used to help classify digital content such as content uploaded to multi-user websites (e.g., social networking websites). In one embodiment, specific categories are used that relate to a social implication of content. For example, text, images, audio or other data formats can provide communication perceived to fall into categories such as violent, abusive, rights management, pornographic or other types of communication. The categories are used to provide a raw score to items in various groupings of a site's content. Where items are related to other items such as by organizational, social, legal, data-driven, design methods, or by other principles or definitions, the related items' raw scores are aggregated to achieve a score for a particular grouping of items that reflects, at least in part, scores from two or more of the related items.Type: GrantFiled: February 28, 2013Date of Patent: November 11, 2014Assignee: Lithium Technologies, Inc.Inventors: Pierre Grenier, Eddie Lo, Satinderpal Sandhu
-
Patent number: 8880732Abstract: Method and system for application isolation is provided. A computing system executes a plurality of processor executable applications and each application is assigned an address by an adapter that is operationally coupled to the computing system. The adapter also assigns a unique identifier to each construct generated by the plurality of applications; maintains a data structure associating the address with the unique identifier assigned to each construct; receives a request having an address and a unique identifier from an application from among the plurality of applications; retrieves the address and the unique identifier from the request; compares the retrieved address and the unique identifier with information maintained by the data structure; and processes the request if the retrieved address and the unique identifier matches with information of the data structure.Type: GrantFiled: February 25, 2011Date of Patent: November 4, 2014Assignee: QLOGIC, CorporationInventor: Kanoj Sarcar
-
Publication number: 20140317155Abstract: A data source, such as a web page, a locally retrieved document, user-entered information, etc., is made visible to a user via a display, such as a computer monitor or touch-screen tablet or smart phone screen. A data capture window, which may be in the form of a data grid, is also displayed to the user, who can select data items from the data source such that they are represented in the data grid. Some data items may also be identified and collected automatically. Data collected into the capture window is then associated with corresponding portions of records in a data base.Type: ApplicationFiled: May 15, 2014Publication date: October 23, 2014Applicant: SEARCHISTICS LLCInventors: VARDA TREIBACH-HECK, IAN D. HART
-
Patent number: 8868621Abstract: The Computer-implemented system, method or computer program that creates a data table of rows and columns from an HTML Web page or document independent of the HTML markup tags. Data embedded in the HTML is identified using clustering of text and extracted into a data table. The generation of data tables can be performed in real-time and is not subject to problems with malformed or poorly created HTML.Type: GrantFiled: October 20, 2011Date of Patent: October 21, 2014Assignee: Rillip, Inc.Inventors: Ray J. D'Onofrio, II, Ray J. D'Onofrio
-
Patent number: 8862636Abstract: A computer-implemented method for use with an extensible markup language (XML) document includes inputting a high-level mapping specification for a schema mapping; and generating a target XML document based on the mapping. The method may perform schema mapping-based XML transformation as a three-phase process comprising tuple extraction, XML-fragment generation, and data merging. The tuple extraction phase may be adapted to handle streamed XML data (as well as stored/indexed XML data). The data merging phase may use a hybrid method that can dynamically switch between main memory-based and disk-based algorithms based on the size of the XML data to be merged.Type: GrantFiled: June 5, 2008Date of Patent: October 14, 2014Assignee: International Business Machines CorporationInventors: Wook-Shin Han, Ching-Tien Ho, Haifeng Jiang, Lucian Popa
-
Patent number: 8856186Abstract: Methods, systems, and computer program products for memory garbage collection via object grouping are disclosed. Normative objects are allocated in a non-native memory referencing respective one or more native objects in a native memory. The non-native objects are grouped into non-native object groups based upon information regarding object references associated with respective ones of the native objects. Objects in respective ones of the one or more non-native object groups are provided one or more references to other said objects of the non-native object groups to create connected object groups. Unused ones of the connected object groups are detected. And objects of the detected unused ones of the connected object groups are de-allocated.Type: GrantFiled: June 29, 2011Date of Patent: October 7, 2014Assignee: Google Inc.Inventors: Peng Li, Dominic Cooney
-
Patent number: 8825716Abstract: Business intelligence, decision support and knowledge management network systems and methods provide efficient, robust, and business-friendly services for the rapid analysis of massive amounts of business, electronic, and other disparate data into actionable intelligence. An advantageous element of the multi-party knowledge network allows the aggregation of common data formats in order to analyze a combined dataset consisting of information from multiple parties, thereby providing additional business intelligence than with a single set of data alone.Type: GrantFiled: March 6, 2014Date of Patent: September 2, 2014Assignee: Cross Commerce Media, Inc.Inventors: Russell William Martin, Jr., Michael Martinov, Heidi Messer, Stephen Messer
-
Patent number: 8819087Abstract: Systems and methods for assisting one or more users in the production of a media work from a plurality of recordings are disclosed. In an exemplary embodiment, digital versions of the recordings are stored on a file-storage server accessible over a communications network. A media-library server provides an organized way for users to access and view the digital versions, and to associate various data files with the digital versions, such as transcription files, logging files, and storyboard documents. The exemplary media library generates web pages that enable users to search and access the digital versions from an internet browser running on a conventional personal computer. The browser also enables users to view the digital versions with a browser-compatible media viewer.Type: GrantFiled: September 5, 2012Date of Patent: August 26, 2014Assignee: Thomas Majchrowski & Associates, Inc.Inventor: Keri DeWitt
-
Patent number: 8812562Abstract: Embodiments described herein provide numerous applications and implementations of a social network to facilitate individuals to resolve various life issues. These issues may include issues that arise when individuals or families relocate, including logistic problems, assimilation of family members in a community, and roommate pairings. As will be described, embodiments described herein greatly facilitate corporations in relocating their employees logistically, and also assist employees and their families with life issues that may determine whether the employees' relocation will be a success.Type: GrantFiled: June 30, 2011Date of Patent: August 19, 2014Inventor: Emily White
-
Patent number: 8812643Abstract: Encoding and/or decoding of messages. On the encoding end, a composite encoder encodes message from an internal format that is used by internal system components into an external format. However, the composite encoder may encode the outgoing messages into different external formats on a per-message basis. For incoming message, a composite decoder decodes incoming messages from any one of a plurality of external formats into the internal format also on a per-message basis. A per-message report mechanism permits internal system components and the encoding/decoding components to communicate information regarding the encoding or decoding on a per message basis. XML messages can be converted automatically into Binary messages.Type: GrantFiled: July 5, 2011Date of Patent: August 19, 2014Assignee: Microsoft CorporationInventors: Stephen Jared Maine, Michael J. Coulson, Tirunelveli R. Vishwanath, Erik B. Christensen
-
Patent number: 8782101Abstract: Methods and systems for transferring data across incompatible database platforms are provided. A method may include reading data from a database export file containing a representation of objects and data from a source database of a first database platform, where the file was generated by an export utility of the first database platform, has a format specific to the first database platform, and is stored outside of the source database. The data read from the file is parsed based on the format. At least one database table represented in the file is identified based on the parsing. Row-level data associated with the at least one database table is also identified based on the parsing. The row-level data is collected in a buffer and is then inserted into a destination database of a second database platform using at least one dedicated data insertion thread.Type: GrantFiled: January 20, 2012Date of Patent: July 15, 2014Assignee: Google Inc.Inventor: Matthew Aaron Moore