Generating An Index Patents (Class 707/741)
  • Patent number: 10985902
    Abstract: Described herein is an efficient, dynamic Symmetric Searchable Encryption (SSE) scheme. A client computing device includes a plurality of files and a dictionary of keywords. An index is generated that indicates, for each keyword and each file, whether a file includes a respective keyword. The index is encrypted and transmitted (with encryptions of the files) to a remote repository. The index is dynamically updateable at the remote repository, and can be utilized to search for files that include keywords in the dictionary without providing the remote repository with information that identifies content of the file or the keyword.
    Type: Grant
    Filed: December 5, 2014
    Date of Patent: April 20, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Seny Fakaba Kamara, Charalampos Papamanthou
  • Patent number: 10983954
    Abstract: Time-series columnar-based information is received and indexed in a compute infrastructure for cost-effective cloud-based object storage. The approach leverages a file format that enables highly-performant search and retrieval of the data stored in the cloud. In operation, an indexer receives the time-series information, indexes that information according to the file format, and forwards the indexed information for storage to the object store, where it is stored as a set of time-based partitions. A partition comprises a set of files, namely, a manifest file, a data file, and an index file. These files are structured as a compact instance of a set of raw unstructured data that comprises the given partition. Highly-performant information retrieval is enabled in response to a time-bounded query, because operations at a query peer (with respect to one or more partitions) are carried out in real-time during query processing and without requiring retrieval of the data file as a whole.
    Type: Grant
    Filed: May 22, 2020
    Date of Patent: April 20, 2021
    Assignee: Hydrolix Inc.
    Inventor: Hasan Alayli
  • Patent number: 10978178
    Abstract: A system and a method are disclosed for searching and ranking molecules based on specificity. To this end, a processor receives a request to search for molecules that correspond to biological features, and generates a mapping of molecules to the biological features by searching publications for a reference to a biological feature in connection with a molecule, and responsively adding to the mapping any found references. The processor determines a respective specificity score for each respective molecule of the plurality of molecules by determining which of the plurality of biological features are mapped to each respective molecule, and for each such respective biological feature, in response to determining that there is not more than one concrete instance that is mapped to the respective molecule, incrementing the respective specificity score by a unit. The processor generates a ranking based on the respective specificity scores and outputs the ranking.
    Type: Grant
    Filed: January 16, 2019
    Date of Patent: April 13, 2021
    Assignee: MERCK SHARP & DOHME CORP.
    Inventors: Ahmed Abdeen Hamed, Agata Leszczynska
  • Patent number: 10977114
    Abstract: Disclosed herein are computer-implemented methods, computer-implemented systems, and non-transitory, computer-readable media, to index blockchain data for storage. One computer-implemented method includes generating one or more encoded blocks by executing error correction coding (ECC) on one or more blocks of a blockchain. Each of the one or more encoded blocks are divided into a plurality of datasets. An index is provided for the one or more encoded blocks, where the index is used to index each dataset of the plurality of datasets to a blockchain node at which a respective dataset is stored.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: April 13, 2021
    Assignee: Alipay (Hangzhou) Information Technology Co., Ltd.
    Inventor: Haizhen Zhou
  • Patent number: 10977324
    Abstract: Enable using a user computer to search the user's personal content items hosted by an online content management service, the online content management service having an indexer computer configured to generate a base index of the user's content items hosted by the service, a base index server computer configured to transmit the generated base index over a network to the user computer, an index mutation generator computer configured to generate index mutations for subsequent changes to user's hosted content items, and an index mutation sever computer configured to transmit the generated index mutations over the network to the user computer. In one aspect, the system and method enable searches of the user's hosted content items and changes thereto to be conducted at the user computer using an index local to the user computer constructed based on the base index and the index mutations received by the user computer.
    Type: Grant
    Filed: February 26, 2018
    Date of Patent: April 13, 2021
    Assignee: Dropbox, Inc.
    Inventors: Samir Goel, Franck Chastagnol, Abhishek Agrawal
  • Patent number: 10970260
    Abstract: Techniques are provided for moving data between partitions. Such a process may be performed without requiring any locks that block transactions that target a partition from being executed. Instead, such transactions may proceed while a move operation is being performed. The move operation involves copying data from the targeted partition to another partition that is hidden from (or “invisible” to) those transactions that attempt to read from or write to the partition. During the move operation, changes that are made to the partition are also reflected in a journal. Eventually, the changes reflected in the journal are drained and applied to the hidden partition. Once the partition and the hidden partition are synchronized, the identities of the partitions are swapped so that future transactions will target the previously-hidden partition instead of the previously-viewable partition.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: April 6, 2021
    Assignee: Oracle International Corporation
    Inventors: Ananth Raghavan, George Eadon, Ramesh Kumar, Hermann Baer, Suresh Sridharan
  • Patent number: 10970316
    Abstract: Techniques are provided for maintaining a class hierarchy of an object-oriented data model in tables of a relational database that minimize performance impact (e.g., in terms of both processing resources and storage footprint). In one embodiment, an object-relational mapping system accesses a class hierarchy of an object-oriented data model. A plurality of tables of a relational database are generated, including a primary table, a joined table and an overflow table. The object-relational mapping system maps classes of the class hierarchy to the primary table, joined table and overflow table, wherein the mapping maps properties of classes to columns of the primary table and joined table until a column threshold is reached and then maps excess properties to columns of the overflow table, and the mapping shares at least one column of the primary table, joined table or overflow table among classes such that multiple different properties are mapped to the shared column.
    Type: Grant
    Filed: October 10, 2018
    Date of Patent: April 6, 2021
    Assignee: Bentley Systems, Incorporated
    Inventors: Affan Khan, Krischan Eberle
  • Patent number: 10965602
    Abstract: Examples include a computing system having a load balancer circuitry to assign data units to destinations using a hash function having a plurality of hash bins, each hash bin being associated with a destination. The computing system includes a hash adjuster to assign a destination to each of the plurality of hash bins, sample assignment of data units to hash bins by the load balancer circuitry over a sample window, analyze the sampled assignments, and reassign destinations to the hash bins based at least in part on the analyzed sampled assignments. This results in increased system performance in processing of the data units by the load balancer.
    Type: Grant
    Filed: March 14, 2019
    Date of Patent: March 30, 2021
    Assignee: Intel Corporation
    Inventors: Carl Geoffrion, Simon Perron Caissy, Eduardo L. Romero
  • Patent number: 10958598
    Abstract: The present disclosure discloses a method and apparatus for generating a candidate reply message. A specific embodiment of the method comprises: acquiring a text message of a currently received conversation; determining whether a pre-established conversation template matching the text message exists; and generating, in response to determining a conversation template matching the text message existing, a candidate reply message on the basis of the conversation template. According to the method provided by embodiments of the present disclosure, when the text message of the currently received conversation matches the pre-established conversation template, the candidate reply message is automatically generated, and a user may click the candidate reply message for reply, so that the time for the user to edit a reply message is saved, a period for the user to communicate with others is reduced, and the communication efficiency is improved.
    Type: Grant
    Filed: August 6, 2018
    Date of Patent: March 23, 2021
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Minqiang Chang, Kefeng Meng, Liang He, Ming Ma
  • Patent number: 10956091
    Abstract: A method begins when a read expansion condition is met for a set of encoded data slices stored in a set of storage units of a first DSN. The method continues with determining whether a difference between a pillar width number and a read threshold (RT) number is greater than or equal to the RT number. When no, the method continues with retrieving an expansion number of encoded data slices from the set of storage units and sending them to storage units of a second DSN for storage. The method continues by receiving a plurality of read requests for the set of encoded data slices, sending a first group of read requests to a RT number of storage units of the first DSN and sending a second group of requests to a second subset of storage units of the first DSN and to the storage units of the second DSN.
    Type: Grant
    Filed: January 24, 2019
    Date of Patent: March 23, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Wesley B. Leggette, Manish Motwani, Brian F. Ober, Jason K. Resch
  • Patent number: 10956401
    Abstract: A method, computer system, and computer program product may include the steps of processing at least a part of the technical document to extract a term denoting a given element of the software program product, performing a respective check on whether the extracted term is correspondingly recorded in at least one of an accessible ontology and an accessible synonym database, whereby each ontology entry comprises a corresponding preferred term in respect of the given element of the software program product, and the accessible synonym database comprises respective entries of the corresponding preferred term, and a synonym corresponding to the corresponding preferred term, in respect of the given element of the software program product, and updating an accessible notification database according to a given check performed against at least one of the accessible ontology database and the accessible synonym database.
    Type: Grant
    Filed: November 28, 2017
    Date of Patent: March 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Kevin Brown, Melita Saville, David Wright
  • Patent number: 10951489
    Abstract: Compliance with a Service Level Agreement (SLA) between an ecommerce provider and its customer may be determined using real user monitoring by collecting data for each HTTP/HTTPS request and response sent to the ecommerce provider. The presence of a content delivery network (CDN), also monitoring every HTTP/HTTPS request sent from a user, through the CDN provides an external, third-party confirmation of compliance. Real user monitoring is collected at the SaaS system by triggering an event rule, collecting data regarding the performance of the request and the response, and logging the created message. The log is processed and stored in a performance monitoring system where indexed for calculations and presentation.
    Type: Grant
    Filed: December 29, 2016
    Date of Patent: March 16, 2021
    Assignee: DIGITAL RIVER, INC.
    Inventors: Mark Aron Beltt, Ryan Patrick Douglas, Chad Martin, Marcus Mechmann, Michael Edwin Borchert
  • Patent number: 10949398
    Abstract: An information management system is provided herein that combines data backup and data migration operations such that data appears available in a network-accessible folder when in fact the data is stored as a secondary copy in a secondary storage device. For example, a user can indicate that a first file should be added to the network-accessible folder. A client computing device can transmit the first file to a secondary storage computing device that performs a backup operation to store a backup copy of the first file in the secondary storage device. The secondary storage computing device can also generate an index of the first file, which includes a location of the backup copy of the first file, and transmit the index to a server that manages the network-accessible folder. Thus, the backup copy of the first file can be retrieved if the first file is selected via the network-accessible folder.
    Type: Grant
    Filed: March 27, 2018
    Date of Patent: March 16, 2021
    Assignee: Commvault Systems, Inc.
    Inventors: Bhavyan Bharatkumar Mehta, Parag Gokhale, Sri Karthik Bhagi, Prosenjit Sinha, Manoj Kumar Pradhan
  • Patent number: 10949708
    Abstract: Methods, systems, and apparatus, for determining fine-grained image similarity. In one aspect, a method includes training an image embedding function on image triplets by selecting image triplets of first, second and third images; generating, by the image embedding function, a first, second and third representations of the features of the first, second and third images; determining, based on the first representation of features and the second representation of features, a first similarity measure for the first image to the second image; determining, based on the first representation of features and the third representation of features, a second similarity measure for the the first image to the third image; determining, based on the first and second similarity measures, a performance measure of the image embedding function for the image triplet; and adjusting the parameter weights of the image embedding function based on the performance measures for the image triplets.
    Type: Grant
    Filed: May 22, 2019
    Date of Patent: March 16, 2021
    Assignee: Google LLC
    Inventors: Yang Song, Jiang Wang, Charles J. Rosenberg
  • Patent number: 10942909
    Abstract: A seed value assigned to a database table is determined. Numerically ordered invocation numbers are determined. Unordered pseudorandom numbers are generated based on the seed value and the invocation numbers to index temporally ordered data changes to the database table. It is determined whether the total number of yet-to-be-saved pseudorandom numbers reaches a maximum total number threshold. If so, the yet-to-be-saved pseudorandom numbers and a sequence of corresponding invocation numbers are saved in a sync table.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: March 9, 2021
    Assignee: salesforce.com, inc.
    Inventor: Rohitashva Mathur
  • Patent number: 10936957
    Abstract: Semantic mediation is accomplished by initially constructing a model of domain-specific requirements associated with one or more nodes in a hierarchy of semantically linked nodes representing encoded aspects of a domain. Followed by determining a target node in the hierarchy for application of at least one property characterized in a topical profile of a first entity. And further followed by producing a first-entity, target node-specific application model that reflects impacts of properties and characteristics of the topical profile with the model of domain-specific requirements through informatic convolution of the model of domain-specific requirements with the topical profile of the first entity.
    Type: Grant
    Filed: April 7, 2020
    Date of Patent: March 2, 2021
    Assignee: Go Logic Decision Time, LLC
    Inventors: Dennis Paul Ackerman, Stephen Francis Taylor
  • Patent number: 10938725
    Abstract: A computer implemented load balancing multimedia conferencing method may be used to reduce processing time required for secure 2D and/or 3D multimedia network communications, and may include the steps of: enabling communication of multimedia content, having audio data that is associated with video frame data, between a sending end-point and a receiving end-point, the receiving end-point having a receiving multimedia content index; determining a load-balancing procedure that divides a video frame of the multimedia content into a first video frame portion and a second video frame portion, the first and second video frame portions associated with audio data; providing the second video frame portion and associated audio data to the receiving end-point; deriving, via the receiving end-point, a third video frame portion using the receiving multimedia content index; and combining, via the receiving end-point, the second video frame portion and derived third video frame portion into a composite video frame.
    Type: Grant
    Filed: September 26, 2019
    Date of Patent: March 2, 2021
    Assignee: Farm & Home Cooperative
    Inventor: Neil Sleevi
  • Patent number: 10931779
    Abstract: A method for data processing is provided. The method may include: preprocessing initial data to obtain preprocessed data; storing the preprocessed data; receiving a data request made through an application, the data request including information relating to a storage path of contents that are requested; in response to the data request, determining, by a nearby proxy of a first proxy cluster in a first region, whether the contents requested in the data request are cached locally; and in response to a determination that the contents are cached locally, providing, by the nearby proxy, the contents to the application; or in response to a determination that the contents are not cached locally, acquiring, by the nearby proxy, the contents based on the information relating to the storage path of the contents; and providing, by the nearby proxy, the contents to the application.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: February 23, 2021
    Assignee: BEIJING VOYAGER TECHNOLOGY CO., LTD.
    Inventors: Fenglei Wang, Lingang Min
  • Patent number: 10922379
    Abstract: A method for processing electronic data includes the steps of transforming the electronic data to a matrix representation including a plurality of matrices; decomposing the matrix representation into a series of matrix approximations; and processing, with an approximation process, the plurality of matrices thereby obtaining a low-rank approximation of the plurality of matrices.
    Type: Grant
    Filed: September 10, 2018
    Date of Patent: February 16, 2021
    Assignee: City University of Hong Kong
    Inventors: Hing Cheung So, Wen-Jun Zeng, Jiayi Chen, Abdelhak M. Zoubir
  • Patent number: 10922295
    Abstract: Disclosed are a system and method for processing HL7 messages. A method includes receiving, an HL7 message; storing the HL7 message in a memory; creating a data structure representation of the stored HL7 message in the memory that stores a start index and an end index of the stored HL7 message, wherein the start index indicates a beginning of the stored HL7 message and the end index indicates an end of the stored HL7 message; hierarchically creating, from the data structure representation of the stored HL7 message, one or more data structure representations in the memory that store at least one start index and at least one end index corresponding to one or more fragments in the stored HL7 message; and, modifying the stored HL7 message according to the at least one start index and a corresponding end index of the fragments in the HL7 message.
    Type: Grant
    Filed: February 3, 2020
    Date of Patent: February 16, 2021
    Inventors: Steven E. Dwire, Hong Gao, Sarika Kandaprabhu, Michael Godinez, Hassan Khan
  • Patent number: 10922361
    Abstract: Some examples include displaying a user interface that includes attributes and entities that are determined to be related to an input entity identified by a user. Further, some implementations include displaying a structured data table that identifies attribute values associated with the input entity and selected related entities.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: February 16, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jun Yan, Ning Liu, Lei Ji, Zheng Chen
  • Patent number: 10915974
    Abstract: A variety of real-time information is collected in real time from a plurality of sources/systems. A current real-time context for activity of a consumer is noted. The real-time information is integrated within the current real-time context and activity of the consumer. In an embodiment, the collected real-time information is crowdsourced from a variety of social media platforms. In an embodiment, the real-time information is integrated as a recommendation made to the consumer for a good or service.
    Type: Grant
    Filed: March 28, 2017
    Date of Patent: February 9, 2021
    Assignee: NCR Corporation
    Inventors: Yehoshua Zvi Licht, Joseph Arnold White, Brent Vance Zucker
  • Patent number: 10909179
    Abstract: Analyzing a large number of files to identify malicious software including evaluating a multigraph including determining a graph having a plurality of nodes, including a source node and target nodes from a data set and merging the graph into a multigraph in response to a node score above a threshold level, for each target node; determining one or more specificity indexes for target node and determining a node score for the target node based, at least in part, on a specificity index.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: February 2, 2021
    Assignee: Avast Software s.r.o.
    Inventor: Peter Ková{hacek over (c)}
  • Patent number: 10909131
    Abstract: Systems and methods are disclosed for efficiently indexing stream data to facilitate full-text search of the stream data. A stream comprises large amount of data, only some of which is deemed useful for full-text search indexing. An administrator can specify an indexing specification for a stream. The indexing specification can specify one or more sub-streams within the stream for indexing, and/or specify one or more time intervals of stream data for indexing. A query against the stream can specify the indexing specification to use to index the stream before returning results for the query. The query can alternatively specify an indexing specification to apply to a previously indexed stream. Full-text search indexes generated using an indexing specification can return results that are more relevant to a user because the results are more narrowly focused than an index of, e.g., the entire stream.
    Type: Grant
    Filed: April 28, 2017
    Date of Patent: February 2, 2021
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Stephen G. Graham, Aaron W. Spiegel, Paul R. Kingston, Flavio Paiva Junqueira, Dominique Prunier
  • Patent number: 10909614
    Abstract: A method for ranking items may include determining a purchaseability score for each item of a specified type based on a number of impressions of the respective item provided to users during a time period, a quantity of the respective item purchased by the users, and a quantity of the respective item added to shopping carts of the users. The method may further include ranking the items based on the purchaseability scores, generating item feed data indicating an ordering of the items based on the rankings, and providing the item feed data to a user device. The user's device may display a scrollable feed of item panels corresponding to the items, with the item panels ordered according to the ordering of the corresponding items.
    Type: Grant
    Filed: February 6, 2019
    Date of Patent: February 2, 2021
    Assignee: CONTEXTLOGIC, INC.
    Inventors: Piotr Szulczewski, Danny S. Zhang, Tarek Fahmy, Jack Z. Xie, Yuli Ye
  • Patent number: 10902352
    Abstract: A computer generates labels for machine learning algorithms by retrieving, from a data storage circuit, multiple label sets that contain labels that each classify data points in a corpus of data. A graph is generated that includes a plurality of edges, each edge between two respective labels from different label sets of the multiple label sets. Weights are determined for the plurality of edges based upon a consistency between data points classified by two labels connected by the edges. An algorithm is applied that groups labels from the multiple label sets based upon the weights for the plurality of edges. Data points are identified from the corpus of data that represent conflicts within the grouped labels. An electronic message is transmitted in order to present the identified data points to entities for further classification. A new label set is generated using the further classification received from the entities.
    Type: Grant
    Filed: January 6, 2020
    Date of Patent: January 26, 2021
    Assignee: International Business Machines Corporation
    Inventors: Prasanta Ghosh, Shantanu R. Godbole, Sachindra Joshi, Srujana Merugu, Ashish Verma
  • Patent number: 10901415
    Abstract: Aspects of the disclosure relate to a system that includes a memory storing a queue for arranging tasks, a plurality of self-driving systems for controlling an autonomous vehicle, and one or more processors. The one or more processors may receive a non-passenger task request with a priority level of the non-passenger task request. When the non-passenger task request is accepted, the one or more processors may insert the task in the queue based on the priority level of the task request. Then, the one or more processors may provide instructions to one or more self-driving systems according to the non-passenger task request. Having received updates of the status of the autonomous vehicle, the one or more processors may determine that the task is completed based on the updates. After determining that the task is completed, the one or more processors may remove the task from the queue.
    Type: Grant
    Filed: May 26, 2015
    Date of Patent: January 26, 2021
    Assignee: WAYMO LLC
    Inventors: Joshua Seth Herbach, Philip Nemec, Peter Colijn
  • Patent number: 10902016
    Abstract: Systems and methods are disclosed for enabling any number of users to share data across any number of web site, mobile, and/or desktop applications (Apps), via an autonomous interdependent repositories (AIR) database. An example method includes grouping data in a plurality of Repositories of the AIR database. The plurality of Repositories may be shared with the user and other users. The method also includes maintaining a plurality of Repository records for the plurality of Repositories. Any number of Applications from any number of different publishers use the AIR database by providing schemas to the AIR databases.
    Type: Grant
    Filed: February 12, 2018
    Date of Patent: January 26, 2021
    Inventor: Artem Shamsutdinov
  • Patent number: 10891340
    Abstract: A method and a system for updating a search index divided into a plurality of chunks, each chunk including a portion of a plurality of documents, comprising: receiving a request to update a set of documents including deletion of a first subset of documents, computing for each given chunk of a set of chunks, each given chunk including at least one document of the first subset: an active size of a set of active documents, the set of active documents not including the at least one document, a total size, the total size not including the at least one document and a ratio of the active to the total size, duplicating, based on the ratio, a subset of chunks to obtain a subset of candidate chunks, updating the set of documents in the subset of candidate chunks, replacing the subset of chunks with the subset of candidate chunks.
    Type: Grant
    Filed: July 8, 2019
    Date of Patent: January 12, 2021
    Assignee: YANDEX EUROPE AG
    Inventors: Aleksey Valerievich Shlyunkin, Alexander Pavlovich Fokin, Petr Sergeevich Popov
  • Patent number: 10887640
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing an artificial intelligence framework for generating enhanced digital content and improving digital content campaign design. In particular, the disclosed systems can utilize a metadata neural network, a summarizer neural network, and/or a performance neural network to generate metadata for digital content, predict future performance metrics, generate enhanced digital content, and provide recommended content changes to improve performance upon dissemination to one or more client devices.
    Type: Grant
    Filed: July 11, 2018
    Date of Patent: January 5, 2021
    Assignee: ADOBE INC.
    Inventors: Viswanathan Swaminathan, Somdeb Sarkhel, Saayan Mitra
  • Patent number: 10885619
    Abstract: Methods and systems for selecting an image. The systems and methods may receive one or more images as well as a context associated with the received imagery. Based on the context of the received imagery, an appropriate engine may analyze the imagery by executing one or more routines. Based on the results of the analysis, one or more of the images may be selected.
    Type: Grant
    Filed: October 17, 2018
    Date of Patent: January 5, 2021
    Assignee: Photo Butler, Inc.
    Inventors: Gerald Hewes, Joe Regan, David Benaim, Joseph C. Cuccinelli, Jr., Andrew Goldfarb
  • Patent number: 10878049
    Abstract: A search apparatus includes a request acquisition unit that acquires a search request from a user, a search processing unit that executes a search process on a basis of a search target designated in the search request, an information acquisition unit acquires user-related information related to the user, a selection unit, a selection unit, and a providing unit. The selection unit selects, as information expected by the user, a piece of search result information highly relevant to the user-related information if the search processing unit acquires, as search results, multiple pieces of information having mutually different semantic contents and being multiple pieces of search result information from among the multiple pieces of search result information having the different semantic contents. The providing unit provides the user with the information expected by the user selected by the selection unit as a search result responding to the search request.
    Type: Grant
    Filed: April 12, 2018
    Date of Patent: December 29, 2020
    Assignee: FUJI XEROX CO., LTD.
    Inventors: Chuan Wang, Takeshi Onishi
  • Patent number: 10878270
    Abstract: Techniques for keypoint-based multi-label word segmentation and localization are described. A machine learning model identifies bounding regions of text within an image, and then generates multiple channel matrices representing predicted keypoints of the text within the bounding regions. The keypoints can be used to rectify the corresponding graphical content from the image including the text to improve the ability to perform optical character recognition and identify the text. Line and word segmentation and localization can be performed together.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: December 29, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Song Cao, Hao Wu, Jonathan Wu, Meng Wang, Rahul Bhotika
  • Patent number: 10860478
    Abstract: A method and an apparatus for storing data are provided. The method includes: acquiring target data, the target data including first key-value pair data, object data or array data, the key-value pair data including key data and value data, the object data including second key-value pair data, and the array data including object data; determining an object data number of the object data included in the target data, an array data number of the array data included in the target data, a first key-value pair data number of the first key-value pair data included in the target data, and a second key-value pair data number of the second key-value pair data included in the target data; determining a storage space based on the determined numbers; and storing the target data into the storage space.
    Type: Grant
    Filed: September 18, 2018
    Date of Patent: December 8, 2020
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Yang Wang, Xunchao Song, Pengcheng Yuan, Xiaobo Liu, Xi Chen
  • Patent number: 10853357
    Abstract: Systems and methods for querying a semantic data set are described. The semantic data set is defined by an ontology that represents a graphical relationship among data included in the semantic data set. One method includes receiving one or more keywords associated with a search operation from a user, and identifying a node associated with each of the one or more keywords. The method includes, for each identified node, tracing a path from the identified node to a root represented in the graphical relationship of the ontology, the path including one or more vertices, and identifying a lowest common ancestor of each of the vertices included in the paths for each identified node. The method includes constructing a subgraph connecting each identified node to the lowest common ancestor, and traversing the subgraph to generate a query in the query language executable against the semantic data set.
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: December 1, 2020
    Assignee: University of Southern California
    Inventors: Muhammad Rizwan Saeed, Charalampos Chelmis, Viktor K. Prasanna
  • Patent number: 10853345
    Abstract: Automatic clustering of a database table is disclosed. A method for automatic clustering of a database table includes receiving an indication that a data modification task has been executed on a table and determining whether the table is sufficiently clustered. The method includes, in response to determining the table is not sufficiently clustered, selecting one or more micro-partitions of the table to be reclustered. The method includes assigning each of the one or more micro-partitions to an execution node to be reclustered.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: December 1, 2020
    Assignee: Snowflake Inc.
    Inventors: Jiaqi Yan, Thierry Cruanes, Jeffrey Rosen, William Waddington, Prasanna Rajaperumal, Abdul Munir
  • Patent number: 10838981
    Abstract: A method of modifying a database architecture to improve database performance is disclosed. A replication database table is generated from an original database table. The original database table is divided into a first plurality of partitions optimized for handling accesses for satisfying a first type of query. The replication database table is divided into a second plurality of partitions optimized for handling accesses for satisfying a second type of database query. Based on receiving the database query of the first type, the query of the first type is routed to one host of a plurality of hosts to satisfy the query, the one host hosting one of the first plurality of partitions; based on receiving the database query of the second type, the query is routed to the same or another host of the plurality of hosts to satisfy the query, the second host hosting one of the second plurality of partitions.
    Type: Grant
    Filed: August 30, 2017
    Date of Patent: November 17, 2020
    Assignee: SAP SE
    Inventor: Friedrich Keller
  • Patent number: 10834449
    Abstract: Methods and apparatus to associate audience members with over-the-top device media impressions. An example audience measurement apparatus includes an impression monitoring system to monitor and log media impressions based on impression requests received via network communications from over the top devices. A linkage database system is to implement the impression monitoring system. A processor in circuit with the impression monitoring system is to: generate Internet protocol (IP) address-to-cookie mappings; associate household identifiers of households with ones of the IP address-to-cookie mappings; and associate ones of the household identifiers to the logged media impressions based on IP addresses associated with the media impressions and based on the household identifiers associated with the ones of the IP address-to-cookie mappings.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: November 10, 2020
    Assignee: THE NIELSEN COMPANY (US), LLC
    Inventors: Steven J. Splaine, Harith Gandhi, Kevin K. Gaynor, Gangadhar Jakkula, Narasimha Reddy Goli, Chris Nicotra, Kumar Rao
  • Patent number: 10824949
    Abstract: A method of training a model, said model being adapted to map a first graphical data structure representation to a second graphical data structure representation, the first graphical data structure representation comprising nodes, with at least one of a plurality of relations between said nodes, the second graphical data structure representation comprising nodes, the mapping comprises using an attention mechanism, wherein said attention mechanism establishes the importance of specific nodes dependent on their neighbourhood and the relations between the nodes, wherein the mapping comprising using a projection kernel to map the nodes of the first graphical structure to nodes of an intermediate representation and using an attention kernel to enact the attention mechanism. The method includes receiving a training data set comprising an output layer and a corresponding input layer. The method also includes training the parameters of the projection kernel and the attention kernel using the training data set.
    Type: Grant
    Filed: April 4, 2019
    Date of Patent: November 3, 2020
    Assignee: Babylon Partners Limited
    Inventors: Daniel William Busbridge, Pietro Cavallo, Dane Grant Sherburn, Nils Yannick Hammerla
  • Patent number: 10824756
    Abstract: A hosted application gateway server node may be communicatively coupled to backend systems, client devices, and database shards associated with database servers. Through the gateway server node, various services may be provided to managed containers running on client devices such that enterprise applications can be centrally managed. A sharding manager may manage relationships of database items across database shards. Each shard stores a copy of a table representing a split of a relationship. A shard ID mask is included in each item's ID. At query time, the shard ID can be extracted and used to query the correct database. This query routing mechanism allows navigation from one shard to another when multiple items are in a relationship (e.g., share the same resource such as a document). As such, embodiments can eliminate the need for APIs to join in data that span multiple shards.
    Type: Grant
    Filed: April 1, 2019
    Date of Patent: November 3, 2020
    Assignee: OPEN TEXT SA ULC
    Inventors: Gregory Beckman, Robert Laird, Geoffrey Michael Obbard
  • Patent number: 10812617
    Abstract: A system for exchanging various forms of information between computer-executable agents. A computing device is configured to determine semantic data associated with each data object (DO) of a plurality of DOs. Each DO is associated with a location, and the semantic data describes the content of the associated DO. The computing device receives, from a first user computing device, a request for DO information and, in response to the request, provides DO information including the locations and the semantic data associated with the retrieved DOs to the user computing device by (a) transmitting the locations and the semantic data to the first user computing device, and/or (b) instructing the first user computing device to request the DO information from a second user computing device to which the locations and the semantic data were previously transmitted.
    Type: Grant
    Filed: March 19, 2018
    Date of Patent: October 20, 2020
    Assignee: Virtuanet LLC
    Inventors: Michael Sean Koonce, Soeb Ahmed
  • Patent number: 10810236
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage medium, for indexing data in information retrieval systems. In one aspect, a system includes computer readable storage media, where the non-transitory computer readable storage media include postings lists for a term, where each posting list includes postings blocks, position blocks, and metadata blocks. Each postings block includes document identifiers and respective indications of frequency of the term in each document. Each position block includes position indications for the term in the documents of a corresponding postings block. Each metadata block includes an indication of the size of the document identifiers in the associated postings block, an indication of the size of the indications of frequency of terms in the associated postings block, an indication of a number of postings in the associated postings block, and an indication of position information for postings in the associated postings block.
    Type: Grant
    Filed: October 23, 2017
    Date of Patent: October 20, 2020
    Assignee: Twitter, Inc.
    Inventors: Yan Zhao, Paul Burstein, Yi Zhuang, Michael Busch
  • Patent number: 10810247
    Abstract: A method for indexing objects in a computerized system having an index, comprising identifying in the computerized system an at least one indexed object that meets an at least one criterion related to contents of the at least one indexed object, detecting an at least one non-indexed object having a property similar to an at least one property of the at least one indexed object that was identified, and indexing the at least one non-indexed object in the index, wherein the method is performed by the computerized system, and an apparatus for performing the same.
    Type: Grant
    Filed: November 8, 2017
    Date of Patent: October 20, 2020
    Assignee: VARONIS SYSTEMS, LTD.
    Inventors: Yakov Faitelson, Ohad Korkus, Ophir Kretzer-Katzir
  • Patent number: 10810198
    Abstract: Techniques related to group determination based on multi-table dictionary codes are disclosed. In some embodiments, one or more non-transitory storage media store a sequence of instructions which, when executed by one or more computing devices, cause performance of a method. The method comprises storing a fact table and a dimension table that share a domain dictionary. The fact table and the dimension table each have a column of encoded join keys that is decodable using the shared domain dictionary. A query may specify one or more row groups for the dimension table. To efficiently process the query, one or more group identifiers are assigned to the one or more row groups. Each row group corresponds to a different group identifier. This enables a code-to-group-identifier mapping to be generated. The code-to-group-identifier mapping correlates the encoded join keys to the one or more group identifiers.
    Type: Grant
    Filed: March 12, 2018
    Date of Patent: October 20, 2020
    Assignee: Oracle International Corporation
    Inventors: Albert Hopeman, IV, Ekrem S. C. Soylemez, Martin Roth
  • Patent number: 10803093
    Abstract: Techniques for labeling data files that enable a file management label to persist on a data file. Variations disclosed herein enable a file management label to be embedded within a data file so that the file management label propagates with the data file when the data file is copied and/or moved between systems and/or workloads. In some embodiments, an application may be configured to write the file management label directly into a stream of the data file. For example, an application client that is run locally on a client device may open the data file by accessing its corresponding stream. While the data file is open within the application client such that a user may edit content of the data file, the application client may be caused to associate the file management label with the data file. Ultimately, the application client may write this association directly into the stream.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: October 13, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Kannan Dhanasekaran, Nakul Garg, Krishna Kumar Parthasarathy, Dheepak Ramaswamy, Yu Li, Martin Gagne, Yonghua Yang, Jinhao Li, Liming Chen, Mingquan Xue
  • Patent number: 10803115
    Abstract: An approach is provided to provide URLs based on a digital subject matter image (DSMI) received from a requestor. The approach identifies a set of classification labels pertaining to the DSMI and matches the set of classification labels against multiple sets of predefined labels with each of the sets of predefined labels corresponding to a URL. The matching results in a selected URL pertaining to the DSMI with this URL being returned to the requestor. In one embodiment, the approach is performed by a domain name system (DNS).
    Type: Grant
    Filed: July 30, 2018
    Date of Patent: October 13, 2020
    Assignee: International Business Machines Corporation
    Inventors: Chih-Hsiung Liu, Cheng-Fang Lin, I-Chien Lin, Chiwen Chang, Ci-Wei Lan, Tsai-Hsuan Hsieh, Kate Lin, Peter Wu, Kuo-Liang Chou
  • Patent number: 10803034
    Abstract: Methods, systems, and computer-readable media for global column indexing in a graph database are disclosed. A plurality of data elements of a graph database are stored. The triples comprise identifiers, column names, and values. The column names are globally scoped in the graph database and are associated with data types. Indices corresponding to the column names are created. A particular one of the indices comprises one or more of the values associated with the corresponding column name. A query is performed on the graph database using one or more of the indices corresponding to one of more of the column names associated with the query.
    Type: Grant
    Filed: November 23, 2016
    Date of Patent: October 13, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Kawarjit Bedi, Piyush Gupta, Xingbo Wang, Sainath Chowdary Mallidi, Andi Gutmans
  • Patent number: 10795920
    Abstract: An information processing device (100) processes a data structure that represents a set of points included in a multidimensional space, and includes: a search unit (10) that, when a particular multidimensional region is specified as a query region, specifies regions that are expressed as portions shared between regions starting with feature value prefixes and regions starting with coordinate prefixes with respect to respective dimensions, and that are included in the query region, and outputs selected sections that are sections included in a data structure corresponding to the specified regions, the feature value prefixes being prefixes obtained from bit representations of feature values of the points, and the coordinate prefixes being prefixes obtained from bit representations of coordinates of the points with respect to a particular dimension; and an aggregation unit (20) that calculates information regarding a set of points included in the query region, by using the selected sections.
    Type: Grant
    Filed: September 10, 2015
    Date of Patent: October 6, 2020
    Assignee: NEC SOLUTION INNOVATORS, LTD.
    Inventors: Yuzuru Okajima, Kouichi Maruyama
  • Patent number: 10785535
    Abstract: In one embodiment, a method defines a plurality of geographic primitives. A geographic primitive is defined by one or more location identifiers. A plurality of distribution areas are generated where a distribution area is defined by one or more of the geographic primitives. The method assigns one of the distribution areas to each of a plurality of pieces of media content. Then, the method stores the assigned one of the plurality of distribution areas for each of the plurality of pieces of media content in a database. One or more of the plurality of pieces of media content are published with the assigned one of the distribution areas. The assigned one of the distribution areas is usable to determine which media content in the one or more of the plurality of pieces of media content is available to users.
    Type: Grant
    Filed: May 29, 2018
    Date of Patent: September 22, 2020
    Assignee: HULU, LLC
    Inventors: Dallas Mahrt, Brandon Lonac, Hank Hwang, Stephen Brown, William Daly, Yingan Wang
  • Patent number: 10779038
    Abstract: In one embodiment, a method defines a plurality of geographic primitives that are defined by one or more location identifiers. A plurality of distribution areas are generated where a distribution area is defined by one or more of the geographic primitives. The method assigns one of the plurality of distribution areas to each of a plurality of pieces of media content. A request is received from a client device that is associated with a location identifier. The method performs a reverse lookup query to a database using a geo-mapping service to retrieve a set of distribution areas that include a geographical primitive associated with the location identifier and selects one or more pieces of media content from the plurality of pieces of media content. Information for at least a portion of the one or more pieces of media content is then provided to the client device.
    Type: Grant
    Filed: June 19, 2017
    Date of Patent: September 15, 2020
    Assignee: HULU, LLC
    Inventors: Dallas Mahrt, Brandon Lonac, Hank Hwang, Stephen Brown, William Daly, Yingan Wang