Patents Examined by Xiaoqin Hu
  • Patent number: 10878003
    Abstract: A system and method for extracting structured information from an implicit table is disclosed. The system and method provide a way to locate and categorize structured information from an implicit table. More specifically, the system and method provide a way of determining which part of an input image document includes a dominant table and which parts of the dominant table make up rows and columns. These details give meaning to the structured information of the implicit table. These details can be used to properly place the structured information from the implicit table into a two-dimensional data structure, such as a data structure in a relational database. In other words, the structured information from a scanned or digital Portable Document Format (PDF) document can be extracted and placed into a useful format, such as a relational database.
    Type: Grant
    Filed: October 29, 2018
    Date of Patent: December 29, 2020
    Assignee: Accenture Global Solutions Limited
    Inventors: Abhisek Mukhopadhyay, Shubhashis Sengupta
  • Patent number: 10877730
    Abstract: A method, system and computer-usable medium are disclosed for preserving temporal relevance of content within a corpus. A corpus is processed to generate temporally-relevant metadata and mined content, which in turn are processed to generate first temporal relevancy metrics. The cache history of a web browser is likewise processed to generate second temporal relevancy metrics, which in turn is processed with the first temporal relevancy metrics to generate first temporal relevancy scores. New documents are ingested into the corpus and existing documents are revised. Temporally-relevant metadata and mined content associated with the updated corpus are then processed to generate third temporal relevancy metrics. The second and third temporal relevancy metrics are then processed to generate second temporal relevancy scores, which is then used to provide a temporally-relevant response to a query.
    Type: Grant
    Filed: September 21, 2016
    Date of Patent: December 29, 2020
    Assignee: International Business Machines Corporation
    Inventors: Sadanand R. Bajekal, Lakshminarayanan Krishnamurthy, Niyati Parameswaran
  • Patent number: 10824631
    Abstract: An information viewing system that allows a viewer to efficiently display its desired posted article by a simple operation. According to the present invention, a front server includes an information page creating unit that creates a plurality of hierarchical information pages and search result pages, using data items stored in the information database, a posted page creating unit that extracts from the posted article database, a posted article associated with an information page displayed on the communication terminal and a posted article associated with a lower-order information page of the information page displayed, to create a posted page corresponding to the information page displayed, and a communication processing unit that sends the information page and the posted page to the communication terminal to display therein.
    Type: Grant
    Filed: January 15, 2016
    Date of Patent: November 3, 2020
    Inventors: Yoshimori Rikukawa, Noboru Shinohara
  • Patent number: 10783128
    Abstract: A computer-implemented method and a computer system are proposed. According to the method, content and a rule are obtained. The rule specifies a condition associated with the content and an action to be performed if the condition is satisfied. A profile, instructions and chain code are determined at least based on the rule. The profile is used for recording a result of the action. The instructions are used for monitoring whether the condition is satisfied and for updating the profile. The chain code is used for operating a blockchain. In response to the condition being satisfied, the profile is updated and the updated profile is stored in association with the content and the rule into the blockchain by the chain code.
    Type: Grant
    Filed: July 13, 2017
    Date of Patent: September 22, 2020
    Assignee: International Business Machines Corporation
    Inventors: Guo Qiang Li, June-Ray Lin, Chih-Hsiung Liu, Hao-Ting Shih
  • Patent number: 10754902
    Abstract: An information processing system includes a processing device communication with an information terminal. The system includes an input unit receiving user input, a display unit, a screen transition unit which decides a screen to be displayed and performs a screen transition based on the input of the input unit and the display unit, a screen transition score calculation unit which calculates a screen transition score for each screen transition, an item database which associates and stores an item name and attribute, an attribute score database which associates and stores the item attribute and an attribute score, an item search unit which searches the item database based on the user's input, and extracts the item attribute, and a score addition unit which adds the screen transition score calculated by the score calculation unit to the attribute score associated with the item attribute extracted by the item search unit.
    Type: Grant
    Filed: October 15, 2015
    Date of Patent: August 25, 2020
    Assignee: CLARION CO., LTD.
    Inventors: Masao Ishiguro, Akira Tanaka, Kazushige Hiroi
  • Patent number: 10747774
    Abstract: A method may relate generally to data science and data analysis, and computer software and systems, to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets. User interfaces may be implemented as computerized tools for presenting summarization of dataset attributes to facilitate discovery, formation, and analysis of interrelated collaborative datasets. A method may include receiving data resulting from insight calculations. Insight calculations may be based on a derived dataset attribute. Also, the method may include presenting a data arrangement overview summarizing the data attributes as an aggregation of data attributes in a portion of the user interface. The data arrangement overview may include an interactive display of a distribution associated with a collaborative atomized dataset.
    Type: Grant
    Filed: March 9, 2017
    Date of Patent: August 18, 2020
    Assignee: data.world, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Jon Loyens, Bryon Kristen Jacob
  • Patent number: 10719405
    Abstract: Methods and systems for efficiently capturing snapshots of a computing application or environment over time and transferring the snapshots to an integrated data management and storage system are described. A snapshot agent may detect that one or more electronic files associated with the computing application or environment are greater than a threshold file size and in response perform an incremental backup optimization in which the snapshot agent may identify files that have been touched since a previous snapshot by accessing file system metadata (e.g., last modified timestamps) or utilizing a tracking agent to detect potential file changes that have occurred since the previous snapshot was captured. The snapshot agent may then generate fingerprints for data blocks of the touched files, which may reduce the total number of fingerprints needing to be generated to identify the changed data blocks corresponding with a current snapshot to be transferred.
    Type: Grant
    Filed: December 11, 2017
    Date of Patent: July 21, 2020
    Assignee: RUBRIK, INC.
    Inventors: Jiangbin Luo, Biswaroop Palit, Guilherme Vale Ferreira Menezes
  • Patent number: 10701154
    Abstract: Sharding a data unit across storage locations (e.g., data centers, clusters, accounts, and so forth). For each location, the multi-link traffic constraints in data channeling with the storage location are identified. For instance, for a given data channel leading to and/or from a storage location, there may be multiple links. Each link may have its own constraints (e.g., in throughput and/or storage). Policy may consider the sharding of files in a manner that honors the constraints. The policy may also be set so that all portions of the shards may be timely accessed in parallel. When sharding, the policy is applied in order to identify initial placement of the computing data unit across the multiple storage locations. Furthermore, the policy may be consulted to determine which storage location to access the shard from and/or which channel to use to access the shard from a given location.
    Type: Grant
    Filed: May 22, 2017
    Date of Patent: June 30, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Sri Sai Kameswara Pavan Kumar Kasturi, Baskar Sridharan, Spiro Michaylov, Joseph M. Gagne, Raghunath Ramakrishnan, Aditya Telidevara
  • Patent number: 10691681
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing local distribution heuristics. One of the methods includes receiving an original expression having a conjunction comprising a context term and a disjunction of a plurality of other terms. The context term and the disjunction are evaluated according to one or more local distribution heuristics to determine that the context term is a candidate for distribution across the disjunction of the plurality of other terms. In response, a transformed expression is generated in which the candidate context term is distributed across the disjunction of the plurality of other terms.
    Type: Grant
    Filed: September 21, 2017
    Date of Patent: June 23, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Aditya Sharad, Alexander Eyers-Taylor, Michael Peyton Jones
  • Patent number: 10678447
    Abstract: Containerization of a block storage service. A scheduler component of a block storage service is initiated in a first container via a containerization technology. A log volume maintained by a host operating system is mounted to the first container to allow the scheduler component access to the log volume. An application programming interface (API) component of the block storage service is initiated in a second container via the containerization technology. The log volume is mounted to the second container to allow the API component access to the log volume of the host operating system. A volume component of the block storage service is initiated in a third container via the containerization technology. A device volume is mounted to the third container to allow the volume component access to the device volume of the host operating system.
    Type: Grant
    Filed: July 13, 2017
    Date of Patent: June 9, 2020
    Assignee: Red Hat, Inc.
    Inventor: Ryan Hallisey
  • Patent number: 10671621
    Abstract: A compute cloud hosts a distributed application and is configured to add or remove instances of the application at datacenters at disperse geographic regions. Operations of the application are captured in a telemetry stream. Each record in the telemetry stream indicates a time, client location, and performance measure for a corresponding client request. After pre-processing such as rounding the times, the telemetry records are passed to a frequent itemset mining algorithm that identifies frequent time-location pairs in the telemetry stream. The frequent time-location pairs are consolidated into encompassing frequent region time-range pairs. An aggregate performance measure is computed from the performance measures of the telemetry records that match a frequent region time-range pair. A recommended region and time for adding or removing instances of the application is computed based on the aggregate performance measure and the region time-range pair.
    Type: Grant
    Filed: December 8, 2017
    Date of Patent: June 2, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Adam Ungar, Benjamin Mark Zhitomirsky
  • Patent number: 10650068
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer-storage medium, for using a search engine implemented on a user device to identify topics or tasks associated with native application content. The method may include actions of receiving a set of data that is generated by the native application and that includes (i) native application content, and (ii) context information associated with the native application content, generating a cluster feature-vector representation based on the set of data, storing the cluster feature-vector representations in a search engine index on the user device, identifying a set of cluster feature-vector representations that are associated with a particular feature-vector representation, identifying one or more tasks that are associated with the set of cluster feature-vector representations, and providing, for output, a user interface including a selectable control that identifies one or more of the tasks.
    Type: Grant
    Filed: January 9, 2017
    Date of Patent: May 12, 2020
    Assignee: Google LLC
    Inventors: Timo Mertens, Maxim Gubin
  • Patent number: 10645548
    Abstract: A method may relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets. One or more computerized tools may be configured to discover, form, and analyze via one or more layered data files, interrelations among a system of networked collaborative datasets. A method may include transforming of a set of data to an atomized format to form an atomized dataset that includes a derived dataset attribute. The method may also include presenting data representing an annotation at the user interface based on the derived dataset attribute. An annotation may be associated with a layer file.
    Type: Grant
    Filed: March 9, 2017
    Date of Patent: May 5, 2020
    Assignee: data.world, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
  • Patent number: 10643031
    Abstract: A system and method for document recommendation combines linguistic and term frequency approaches in order to improve overall performance and recommendation. Hypernyms are added to the source document set so that related documents can be recommended from the source document set. There are a variety of different parameters to configure that change operation of the document recommendation system.
    Type: Grant
    Filed: March 10, 2017
    Date of Patent: May 5, 2020
    Assignee: UT-Battelle, LLC
    Inventors: Thomas E. Potok, Robert M. Patton, Sreenivas R. Sukumar
  • Patent number: 10635686
    Abstract: Systems and methods for implementing extensible attributes in ETL are disclosed. In some examples, attributes configured at a source file may be extracted from the source file. The extracted attributes can be mapped to a target column of a data warehouse table, and then a dynamic ETL script may be generated. The dynamic script may be executed to move data associated with the attributes to an appropriate new column of the data warehouse.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: April 28, 2020
    Assignee: Oracle International Corporation
    Inventors: Dylan Wan, Francoise J. Lawrence, Justin Hyde, Amit Goyal, Saurabh Verma, John D. Poole
  • Patent number: 10621492
    Abstract: The present disclosure relates to a method for centrally processing data records using a record linkage algorithm. The method comprises providing a centralized master repository for storing data records in a predefined data structure having a set of attributes. At least one clustering metric is provided. Clusters of records may be determined using a clustering function that is based on the at least one clustering metric. For each particular cluster, a set of configuration data for the record linkage algorithm may be defined based on a value of the clustering metric within that particular cluster. The individual data records may be assigned to one or more clusters of the clusters using the clustering metric values and the record linkage algorithm may be applied to a set of two or more individual data records assigned to at least one common cluster using the set of configuration data for the common cluster.
    Type: Grant
    Filed: October 21, 2016
    Date of Patent: April 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Martin Oberhofer, Yannick Saillet, Scott Schumacher, Jens P. Seifert
  • Patent number: 10621493
    Abstract: The present disclosure relates to a method for centrally processing data records using a record linkage algorithm. The method comprises providing a centralized master repository for storing data records in a predefined data structure having a set of attributes. At least one clustering metric is provided. Clusters of records may be determined using a clustering function that is based on the at least one clustering metric. For each particular cluster, a set of configuration data for the record linkage algorithm may be defined based on a value of the clustering metric within that particular cluster. The individual data records may be assigned to one or more clusters of the clusters using the clustering metric values and the record linkage algorithm may be applied to a set of two or more individual data records assigned to at least one common cluster using the set of configuration data for the common cluster.
    Type: Grant
    Filed: January 2, 2018
    Date of Patent: April 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Martin Oberhofer, Yannick Saillet, Scott Schumacher, Jens P. Seifert
  • Patent number: 10620835
    Abstract: A Windows file system can be attached to a remote non-Windows disk stack thereby allowing a mass storage device to be redirected at the disk level even though a client terminal is running a non-Windows operating system. A client-side proxy can include a disk provider that is configured to obtain disk information from a mass storage device connected to the client terminal and provide it to the server-side agent. A virtual disk enumerator on the server can employ the disk information to emulate a disk stack so that a Windows specific file system can be loaded on the server. Any Windows-specific management commands that target the mass storage device can be handled by the virtual disk enumerator using the disk information, whereas any access commands can be routed to the disk provider which can interface with a non-Windows disk stack on the client terminal for handling.
    Type: Grant
    Filed: January 27, 2017
    Date of Patent: April 14, 2020
    Assignee: Wyse Technology L.L.C.
    Inventors: Gokul Thiruchengode Vajravel, Ramanujam Kaniyar Venkatesh
  • Patent number: 10606850
    Abstract: A mechanism is provided for updating a knowledge base of a spam detection system for email messages. A first and a second email set of email messages related to a time window are created. The first email set comprises email messages currently classified as spam messages, the second email set email messages currently classified as non-spam messages. A natural language term that is not present in the knowledge base is extracted from one of the email messages contained in one of the two email sets. Based on a frequency of occurrence of the extracted term in email messages contained in one or more of the two email sets and based on whether the respective email messages are spam messages or non-spam messages, a score value of the extracted term is calculated. The extracted term and the calculated score value are stored into the knowledge base.
    Type: Grant
    Filed: September 21, 2017
    Date of Patent: March 31, 2020
    Assignee: International Business Machines Corporation
    Inventors: Michele Crudele, Antonio Perrone