Of Unstructured Textual Data (epo) Patents (Class 707/E17.058)
  • Patent number: 11977569
    Abstract: Disclosed is a natural language processing pipeline that analyzes and processes a corpus of textual data to automatically create a knowledge graph containing the corpus entities such as subjects and object and their relationships such as predicates or verbs. The pipeline is configured as an end-to-end neural Open Schema Construction pipeline having a coreference resolution module, an open information extraction (OIE) module, and an entity canonicalization module. The processed textual data is input to a graph database to create the knowledge graph displayable through a graphical user interface. In operation, the pipeline modules serve to create a single term for all entity mentions in the corpus that reference the same entity through coreference resolution, extract all subject-predicate-object triplets from the coreference resolved corpus through OIE, and then canonicalize the corpus by clustering each entity mention to a canonical form for mapping to the knowledge graph and display.
    Type: Grant
    Filed: January 28, 2022
    Date of Patent: May 7, 2024
    Assignee: The United States of America, Represented by the Secretary of the Navy
    Inventors: Michael Lynn Potter, Natalie Lynn Larson, Amy Cheng, Hovanes Keseyan, Hanh Servin
  • Patent number: 11914629
    Abstract: The present invention relates to information retrieval. In order to facilitate a search and identification of documents, there is provided a computer-implemented method for training a classifier model for data classification in response to a search query.
    Type: Grant
    Filed: August 5, 2021
    Date of Patent: February 27, 2024
    Assignee: BASF SE
    Inventors: Arunav Mishra, Henning Schwabe, Lalita Shaki Uribe Ordonez
  • Patent number: 11914961
    Abstract: Systems, devices, and methods of the present invention involve discourse trees. In an example, a method involves generating a discourse tree. The method includes identifying, from the discourse tree, a central entity that is associated with a rhetorical relation of type elaboration and corresponds to a topic node that identifies a central entity of the text. The method includes determining a subset of elementary discourse units of the discourse tree that are associated with the central entity. The method includes forming generalized phrases from the subset of elementary discourse units. The method includes forming tuples from the generalized phrases, where a tuple is an ordered set of words in normal form. The method involves responsive to successfully converting an elementary discourse unit associated with an identified tuple into a logical representation, updating the ontology with an entity from the identified tuple.
    Type: Grant
    Filed: September 3, 2021
    Date of Patent: February 27, 2024
    Assignee: Oracle International Corporation
    Inventor: Boris Galitsky
  • Patent number: 11900181
    Abstract: A data object from a data source is received by a distributed process in a data stream. The distributed process has a sequence of categories, each category containing one or more tasks that operate on the data object. The data object includes files that can be processed by the tasks. If the task is able to operate on the data object, then the data object is passed to the task. If the task is unable to operate on the data object, then the files in the data object are passed to a file staging area of the distributed process and stored in memory. The files in the file staging area are passed, in sequence, from the file staging area to the task that was unable to operate on the data object. The data object is outputted to a next category or data sink after being operated on by the task.
    Type: Grant
    Filed: April 28, 2021
    Date of Patent: February 13, 2024
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Shalini Raghavan, Tom J. Traughber, George Vanecek, Jr.
  • Patent number: 11900177
    Abstract: In an example embodiment, a graphical user interface-based software tool is provided that uses integrated process information and information of a technical infrastructure to provide automatically-analyze integrations. The tool displays a list of integrations available with one or more corresponding software products linked to the first process by the linkage, each integration in the list of integrations identifying a separate software product with a separate application program interface (API) from the first software product.
    Type: Grant
    Filed: December 6, 2021
    Date of Patent: February 13, 2024
    Assignee: SAP SE
    Inventor: Daniel Oberle
  • Patent number: 11886478
    Abstract: One or more computing devices, systems, and/or methods are provided. In an example, a first performance metric score may be determined based upon first content item text. A plurality of similarity scores associated with a plurality of sets of content item text may be determined. One or more sets of content item text may be selected from among the plurality of sets of content item text based upon the plurality of similarity scores and a plurality of performance metric scores associated with the plurality of sets of content item text. The plurality of performance metric scores may comprise one or more performance metric scores associated with the one or more sets of content item text. The one or more performance metric scores may be higher than the first performance metric score. One or more representations of the one or more sets of content item text may be displayed.
    Type: Grant
    Filed: May 7, 2021
    Date of Patent: January 30, 2024
    Assignee: Yahoo Assets LLC
    Inventors: Shaunak Mishra, Changwei Hu, Kevin Yen, Manisha Verma, Yifan Hu, Maxim Ivanovich Sviridenko, Avinash Chukka, Max Edward Beech, Chao-Hung Wang, Hua-Ying Tsai, Kamil Michal Zasadzinski, Wei Yu Lin, Yu Tian
  • Patent number: 11869131
    Abstract: A system and method for determining an absolute position of an object in an area is presented. The system includes a server having a processor, and a plurality of camera nodes coupled to the server. Each node includes a camera that acquires images of the object and area. The server receives image data from a camera, detects the object within an approximate location by image analysis techniques, and determines a relative position of the object in pixel coordinates. The processor then detects stationary markers proximate to the relative location of the object, determines an absolute position of the detected markers relative to known markers to define an absolute position of the marker, and determines an absolute location of the object in relation to the absolute location of the detected marker. This absolute position of the object is provided to an official to accurately locate the object in the area.
    Type: Grant
    Filed: August 30, 2022
    Date of Patent: January 9, 2024
    Assignee: PRECISION POINT SYSTEMS, LLC
    Inventors: Daniel Kohler, Terence Sauer, David Schroeder, Dennis Wanzie
  • Patent number: 11847499
    Abstract: Systems and methods for coordinating components can include: determining, by a first application executing on a client device, a need to perform a sharable functional task; identifying a first software component installed on the client device and capable of performing a first variation of the sharable functional task; identifying a second software component installed on the client device and capable of performing a second variation of the sharable functional task, wherein the second variation of the sharable functional task is functionally overlapping with and not identical to the first variation; identifying a set of characteristics of both the first software component and the second software component; selecting the second software component for performing the sharable functional task based on the set of characteristics, where the set of characteristics includes at least a version number; and delegating performance of the sharable functional task to the second software component.
    Type: Grant
    Filed: December 15, 2021
    Date of Patent: December 19, 2023
    Assignee: LOOKOUT INC.
    Inventors: Matthew John Joseph LaMantia, Brian James Buck, Stephen J. Edwards, William Neil Robinson
  • Patent number: 11797275
    Abstract: System and methods of ontology model development are disclosed. An example system and method may comprise, defining, an ontological model, generating, an ontology library based on the ontological model, deploying, the ontology library to an IoT system, generating, an ontology instance based on the ontology library deployed to the IoT system, modifying, an IoT application based on the ontology instance, and managing the IoT system utilizing the IoT application.
    Type: Grant
    Filed: July 9, 2018
    Date of Patent: October 24, 2023
    Assignee: SCHNEIDER ELECTRIC USA, INC.
    Inventors: Charbel Joseph El Kaed, André Ponnouradjane
  • Patent number: 11789949
    Abstract: A partition key format for allocating partitions to data items in a single table database, where the data items are owned by different entities. The partition key format including a sequence of a plurality of frames, wherein a first of said frames is an identifier of the requesting entity (EID), and a second one of said frames is an identifier of the type of data item (TID).
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: October 17, 2023
    Assignee: Command Alkon Incorporated
    Inventors: Douglas A. Moore, Todd McPartlin
  • Patent number: 11775596
    Abstract: Some embodiments provide a method for defining a content relevance model for determining whether a content segment is relevant to a particular category. The method receives a first set of content segments that contain content relevant to the particular category and a second set of content segments that contain content not relevant to the particular category. The method identifies a set of key word sets more likely to appear in the first set of content segments than the second set of content segments. The method defines a content relevance model that comprises a set of groups of word sets and a score for each group, each of the groups of word sets comprising a key word set from the set of key word sets and at least one word set found in a context of the key word set in at least one of the received content segments.
    Type: Grant
    Filed: April 21, 2022
    Date of Patent: October 3, 2023
    Assignee: Aurea Software, Inc.
    Inventors: Ashutosh Joshi, Martin Betz, Rajiv Arora, Rakesh Kumar Srivastava, David Cooke
  • Patent number: 11776291
    Abstract: Systems and methods for generation and use of document analysis architectures are disclosed. A model builder component may be utilized to receiving user input data for labeling a set of documents as in class or out of class. That user input data may be utilized to train one or more classification models, which may then be utilized to predict classification of other documents. Trained models may be incorporated into a model taxonomy for searching and use by other users for document analysis purposes.
    Type: Grant
    Filed: June 10, 2020
    Date of Patent: October 3, 2023
    Assignee: AON RISK SERVICES, INC. OF MARYLAND
    Inventors: Samuel Cameron Fleming, David Craig Andrews, John E. Bradley, III, Lewis C. Lee, Jared Dirk Sol, Timothy Seegan, Scott Buzan
  • Patent number: 11762893
    Abstract: Creating a summary of a plurality of texts includes tokenizing each of a plurality of texts to obtain tokens; generating a vector space using a first set of vectors having one or more obtained feature scores equal to or larger than a predefined value; executing non-hierarchical clustering using the vector space to generate a first plurality of clusters; choosing a first representative text in each of the plurality of clusters; generating a second set of vectors from each of the arrays generated based on a number of characters included in tokens of the representative texts; executing hierarchical clustering using the second set of vectors to generate a second plurality of clusters; and in response to a determining a number of clusters included in the second plurality of clusters, determining a second representative text for each of the clusters included in the second plurality of clusters.
    Type: Grant
    Filed: December 14, 2021
    Date of Patent: September 19, 2023
    Assignee: International Business Machines Corporation
    Inventors: Yu Gu, Takayuki Kushida, Hiroki Nakano, Yaoping Ruan, Yuji Sugiyama
  • Patent number: 11755843
    Abstract: Systems and techniques that facilitate spurious relationship filtration from external knowledge graphs based on distributional semantics of an input corpus are provided. In one or more embodiments, a context component can generate a context-based word embedding of one or more first terms in a document collection. The embedding can yield vector representations of the one or more first terms. The one or more first terms can correspond to knowledge terms in one or more first nodes of a knowledge graph. In one or more embodiments, a filtering component can filter out a relationship between the one or more first nodes and a second node of the knowledge graph based on a similarity value being less than a threshold. The similarity value can be a function of the vector representations of the one or more first terms. In various embodiments, cosine similarity can be used to compute the similarity value.
    Type: Grant
    Filed: May 18, 2021
    Date of Patent: September 12, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Nandana Mihindukulasooriya, Robert G. Farrell, Nicolas Rodolfo Fauceglia, Alfio Massimiliano Gliozzo
  • Patent number: 11722507
    Abstract: The disclosed embodiments relate to a system that generates an alert based on information extracted from search results generated by a query. During operation, the system executes the query to generate the search results. The system also obtains configuration information for the alert, wherein the configuration information identifies information associated with the search results, and also specifies a trigger condition for the alert. Next, when the trigger condition for the alert is met, the system uses the configuration information to generate a payload containing the identified information associated with the search results. The system then invokes alert-generating functionality and provides the payload as input to the alert-generating functionality. This enables the alert-generating functionality to use the information from the search results while performing one or more alert actions association with the alert.
    Type: Grant
    Filed: March 18, 2022
    Date of Patent: August 8, 2023
    Assignee: Splunk Inc.
    Inventors: Nicholas J. Filippi, Siegfried Puchbauer-Schnabel, Carl S. Yestrau, Vivian Shen, J. Mathew Elting
  • Patent number: 11710496
    Abstract: A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.
    Type: Grant
    Filed: July 1, 2019
    Date of Patent: July 25, 2023
    Assignee: Google LLC
    Inventors: Aaron Donsbach, Dirk Padfield
  • Patent number: 11699447
    Abstract: Systems and methods are provided herein for determining one or more traits of a speaker based on voice analysis to present content item to the speaker. In one example, the method receives a voice query and determines whether the voice query matches within a first confidence threshold of a speaker identification (ID) among a plurality of speaker IDs stored in a speaker profile. In response to determining that the voice query matches to the speaker ID within the first confidence threshold, the method bypasses a trait prediction engine and retrieves a trait among the plurality of traits in the speaker profile associated with the matched speaker ID. The method further provides a content item based on the retrieved trait.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: July 11, 2023
    Assignee: ROVI GUIDES, INC.
    Inventors: Ankur Anil Aher, Jeffry Copps Robert Jose
  • Patent number: 11651152
    Abstract: A method may include obtaining a document and using a first prediction model to generate text block scores for text blocks in the document, where a first text block of the text blocks is associated with a first text block score of the plurality of text block scores. The method also includes updating, in response to the first text block score for the first text block failing to satisfy a criterion, a modified version of the document with an indicator to set the first text block as a hidden text block in a presentation of the modified version. The method also includes generating a summarization of the first text block based on the words in the first text block and updating the modified version of the document to include the summarization. The method also includes providing the modified version of the document to a user device.
    Type: Grant
    Filed: June 10, 2021
    Date of Patent: May 16, 2023
    Assignee: Capital One Services, LLC
    Inventors: Austin Walters, Anh Truong, Jeremy Goodsitt, Vincent Pham, Galen Rafferty, Reza Farivar
  • Patent number: 11636530
    Abstract: Aspects of the disclosure relate to content prediction. A computing platform may train a collaborative recommendation engine to output recommendation information based on historical preference information and corresponding data drift. The computing platform may receive an account access request from a user device. The computing platform may identify, at a first time and using the collaborative recommendation engine, a preference group for the user device. The computing platform may receive, at a second time later than the first time, a second account access request from the user device. The computing platform may identify, using the collaborative recommendation engine, the preference group and data drift corresponding to the preference group between the first time and the second time, which may indicate a second set of preferences at the second time. The computing platform may generate, based on the second set of preferences, recommendation information for the user device.
    Type: Grant
    Filed: June 28, 2021
    Date of Patent: April 25, 2023
    Assignee: Bank of America Corporation
    Inventor: Maharaj Mukherjee
  • Patent number: 11550810
    Abstract: Methods and systems for providing computer-assisted guided review of unstructured data to generate a structured data output based on customizable template rules. In embodiments, an unstructured file is received, and a predefined template is selected. The predefined template includes a plurality of fields, each field corresponding to a field of the structured report. The predefined template also defines extraction rules for each field of the predefined template, and the extraction rules define parameters for identifying unstructured data relevant to the associated field. The extraction rules are applied to the unstructured file to identify data relevant to the field associated with the corresponding extraction rule, and the data identified as relevant is confirmed. Confirming the relevant data includes determining to refine the relevant data based on a condition, and modifying the extraction rule associated with the field to refine the relevant data.
    Type: Grant
    Filed: February 6, 2019
    Date of Patent: January 10, 2023
    Assignee: Thomson Reuters Enterprise Centre GmbH
    Inventors: Hella-Franziska Hoffmann, Johannes Schleith
  • Patent number: 11544467
    Abstract: The present disclosure relates to processing operations configured to provide a linguistic-based approach to evaluating repetition in content of an electronic document. The approach of the present disclosure is about detecting terms/words/phrases that are likely to be perceived as being repetitious by native speakers of a language rather than just identifying the occurrence of identical words or strings in a document as done by traditional language checks. Processing of the present disclosure detects and evaluates terms or phrases using positive linguistic evidence derived from evaluation of linguistic relationships between words in a string in syntactic ways. This results in more accurate and efficient determination as to whether a term is truly repetitious at the linguistic level as compared with traditional language checks.
    Type: Grant
    Filed: June 15, 2020
    Date of Patent: January 3, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Davide Turcato, Alfredo R. Arnaiz, Domenic Joseph Cipollone, Michael Wilson Daniels
  • Patent number: 11522911
    Abstract: Methods, systems, and computer program products for performing passive and active identity verification in association with online communications. For example, a computer-implemented method may include receiving one or more electronic messages associated with a user account, analyzing the electronic messages based on a plurality of identity verification profiles associated with the user account, generating an identity trust score associated with the electronic messages based on the analyzing, determining whether to issue a security challenge in response to the electronic messages based on the generated identity trust score, and issuing the security challenge in response to the electronic messages based on the determining.
    Type: Grant
    Filed: April 19, 2021
    Date of Patent: December 6, 2022
    Assignee: PAYPAL, INC.
    Inventors: Bradley Wardman, Jakub Ceiran Burgis, Nicole Harris, Blake Butler, Nathan Robert Pratt, Kevin James Tyers
  • Patent number: 11521091
    Abstract: A computer-implemented method, a computer program product, and a computer system for enhanced distributed machine learning. A fusion server in a distributed machine learning system determines correlation relationships across agents in the distributed machine learning system, based on auxiliary information. The fusion server clusters the agents to form one or more communities, based on the correlation relationships. The fusion server selects, from the one or more communities, participating agents that participate in the enhanced distributed machine learning.
    Type: Grant
    Filed: May 22, 2019
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Changchang Liu, Shiqiang Wang, Wei-Han Lee, Seraphin Bernard Calo
  • Patent number: 11481389
    Abstract: Methods, systems, and computer program products for generating an executable code based on a document are disclosed. Rules are identified in a document, the identified rules are translated into encoded rules, and an executable code is generated from the encoded rules. Identification of rules includes splitting a text of the document into a plurality of sentences; and for each sentence of the plurality of sentences, determining whether the sentence corresponds to a rule. Translation of an identified rule into an encoded rule includes extracting, from the identified rule, elements corresponding to predefined categories; determining one or more relationships between the extracted elements; and translating the one or more determined relationships into a structured expression. Generating the executable code from the encoded rules includes translating the structured expression associated with the identified rule into a programming language query.
    Type: Grant
    Filed: June 20, 2018
    Date of Patent: October 25, 2022
    Inventors: Youness Mansar, Sira Ferradans
  • Patent number: 11461343
    Abstract: Embodiments of the systems and methods disclosed herein provide a prescriptive analytics platform and polarity analysis engine in which a user can identify a target objective and use the system to find out whether the user's objectives are being met, what predictive factors are positively or negatively affecting the targeted objectives, as well as what recommended changes the user can make to better meet the objectives. The systems and methods may include a polarity analysis engine configured to determine the polarity of terms in free-text input in view of the target objective and the predictive factors and use the polarity to generate the recommended changes.
    Type: Grant
    Filed: February 4, 2021
    Date of Patent: October 4, 2022
    Assignee: Clearsense Acquisition 1, LLC
    Inventors: Adrian Marc Bir, Nikolai Nikolaevich Liachenko, Daniel Brooks Presley
  • Patent number: 11449496
    Abstract: An example embodiment may involve a software application executable on computing devices of a remote network management platform and a computation instance associated with a managed network. The computational instance may contain a database storing data of the managed network. The software application may receive, from a client device of the managed network, a natural language query (NLQ), and retrieve Backus-Naur form (BNF) rules and a set of metadata associated with the BNF rules. The metadata may include a text-based description of a schema of the database and abbreviations associated with the BNF rules. The NLQ may be parsed using the BNF rules together with the metadata by applying the metadata during parsing to extend the BNF rules. A query object based on the parsed query may be generated, and the database searched using the query object. A result of the search may be transmitted to the client device.
    Type: Grant
    Filed: October 25, 2019
    Date of Patent: September 20, 2022
    Assignee: ServiceNow, Inc.
    Inventors: Mikhail Rumiantsau, Aliaksei Vertsel
  • Patent number: 11392675
    Abstract: Methods, systems, and computer-readable media for request authorization using service coordination are disclosed. An authorization data structure and an operation data structure are selected based at least in part on a request for an operation. The authorization data structure comprises a directed acyclic graph representing a flow of data between service operations associated with authorization of the operation, and the operation data structure comprises a directed acyclic graph representing a flow of data between a service operations associated with execution of the operation. Authorization of the operation is attempted using the authorization data structure, comprising invoking one or more of the service operations associated with authorization. If the authorization is successful, then the execution of the operation is initiated using the operation data structure, comprising invoking one or more of the service operations associated with execution.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: July 19, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Robin Alan Golden, Marc Andrew Bowes, Izak Van Der Merwe
  • Patent number: 11341146
    Abstract: Data may be queried and analyzed in order to draw insights. One type of data query that may be performed is a funnel query. A funnel query is a query characterized by a sequence of events, e.g.: “In the last N days, how many unique users performed event A, then event B, and then event C”. Systems and methods for performing funnel queries are provided herein. In some embodiments, the speed at which a computer can answer a funnel query may be increased. In some embodiments, a bitmap is used to eliminate one or more sequences of events that would otherwise need to be traversed during the funnel query. In some embodiments, a sequence of events is stored across multiple data partitions, each data partition covering a different period of time.
    Type: Grant
    Filed: June 21, 2019
    Date of Patent: May 24, 2022
    Assignee: SHOPIFY INC.
    Inventors: Mikhal Arkhangorodsky, Mohammad Zeeshan Qureshi
  • Patent number: 11308096
    Abstract: Systems and methods for debiasing a recommendation engine are disclosed herein. A search query associated with a user profile is received at a recommendation engine. Control circuitry generates a result set of items of content based on the search query and generates a bias score for a content attribute based on the result set. The control circuitry also generates a time-averaged bias score for the content attribute based on a plurality of search queries associated with the user profile. Based on the bias score and the time-averaged bias score, the control circuitry determines whether a bias is signaled for the content attribute. Finally, the control circuitry outputs, for display via a computing device, the result set or a debiased result set based on a result of the determination of whether the bias is signaled.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: April 19, 2022
    Assignee: Rovi Guides, Inc.
    Inventors: Rajendran Pichaimurthy, Madhusudhan Srinivasan
  • Patent number: 10592565
    Abstract: An objective of the present invention is providing a method and apparatus for providing recommended information. A method according to the present invention comprises steps of: determining, based on one or more pieces of content information in one or more webpages, whether the one or more pieces of content information may be used as recommended information, respectively; obtaining feature information of the recommended information if the content information is recommended information; determining ordering information of the each piece of recommended information based on the feature information of each piece of recommended information; wherein the method further comprises the following step: if a user's browsing operation on the webpage corresponds to at least one piece of recommended information, presenting the at least one piece of recommended information.
    Type: Grant
    Filed: December 31, 2014
    Date of Patent: March 17, 2020
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Chi Tang, Huadong Li, Weiyu Chen, Jiajia Chen, Xunliang Cai, Yang Song
  • Patent number: 10579663
    Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.
    Type: Grant
    Filed: May 2, 2017
    Date of Patent: March 3, 2020
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu
  • Patent number: 10394875
    Abstract: A document relationship analysis system. Aspects of the system include ingesting, discovering, recommending, analyzing, and exporting documents of interest. The system dynamically searches large or streaming datasets using a tiered, multi-step approach that includes discovery techniques and recommender components to filter and refine these larger datasets to smaller datasets of documents of interest. The system dynamically selects and renders an appropriate visualization for result datasets based on predetermined measures that allow for facilitate analysis of the documents of interest.
    Type: Grant
    Filed: February 16, 2018
    Date of Patent: August 27, 2019
    Assignee: VortexT Analytics, Inc.
    Inventors: Matthew Cody Lambert, Peter Joseph Angerani, Gregory David Ostermayr
  • Patent number: 10255283
    Abstract: A mechanism for progressive topic modeling is disclosed to facilitate document content analysis. Input documents can be sorted and divided into multiple groups. Topic modeling is performed for each group, where the topic modeling for one group is based on the generated topic model from a previous group, if available. The vocabulary used in the topic modeling process can also be updated for each group of documents. The generated topics can be presented in a user interface to facilitate a user in analyzing the documents. The topic modeling mechanism can also be utilized to enhance a document search experience by generating topics from documents contained in search results and presenting topic words to a user as suggested search terms.
    Type: Grant
    Filed: September 19, 2016
    Date of Patent: April 9, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Weiwei Cheng, Christopher Gonzales
  • Patent number: 9928295
    Abstract: A document relationship analysis system. Aspects of the system include ingesting, discovering, recommending, analyzing, and exporting documents of interest. The system dynamically searches large or streaming datasets using a tiered, multi-step approach that includes discovery techniques and recommender components to filter and refine these larger datasets to smaller datasets of documents of interest. The system dynamically selects and renders an appropriate visualization for result datasets based on predetermined measures that allow for facilitate analysis of the documents of interest.
    Type: Grant
    Filed: February 2, 2015
    Date of Patent: March 27, 2018
    Assignee: Vortext Analytics, Inc.
    Inventors: Matthew Cody Lambert, Peter Joseph Angerani, Gregory David Ostermayr
  • Patent number: 9773049
    Abstract: A method includes receiving a first request from a first user device for first data, where the first request identifies a first data source and sending a first data access request to the first data source, where the first data access request is based on a first reader object associated with the first data source. The method also includes receiving the first data from the first data source, where the first data has a first format, and transforming the first data to normalized data in a normalized format based on the first reader object. The method further includes selecting a first presentation object from a database comprising a plurality of presentation objects based on a first device type of the first user device and transforming the normalized data to output data in an output format based on the first presentation object.
    Type: Grant
    Filed: June 16, 2015
    Date of Patent: September 26, 2017
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Bryan Davis, Dan Musgrove, Michael Raftelis
  • Patent number: 9753921
    Abstract: A content management system including a document management system provides documents that include comments entered by users. Comments are organized into threads; each thread is associated with a span of text in the document. When a user requests access to a document, the document management system determines which threads are visible to the user based on an audience associated with each thread. the audience comprises the user identifiers of i) the author of the document containing the thread; ii) the authors of comments included in the thread; iii) the authors of any text included in the text span for the thread; iv) any user mentioned in the text span the thread via a user primitive; v) any user mentioned in a comment via user primitive.
    Type: Grant
    Filed: April 29, 2015
    Date of Patent: September 5, 2017
    Assignee: DROPBOX, INC.
    Inventors: Anthony DeVincenzi, Matthew Blackshaw, Balabhadra Graveley, Igor Kofman
  • Patent number: 9471644
    Abstract: A computer-implemented method, computer-readable medium and system for scoring a text are disclosed. Themes within one or more texts may be determined and used to score each text, where an overall score for each text may indicate a respective importance and/or value of each text. The score for each text may be determined based upon a number of themes, type of themes, frequency of theme elements associated with the themes, distribution of theme elements associated with the themes, location of themes in the text, some combination thereof, etc. In this manner, the importance or value of one or more texts may be determined more accurately using information within each text with reduced reliance upon external information. Additionally, more relevant search results can be returned to a user by using internal information to perform ranking operations and/or filtering operations associated with a search.
    Type: Grant
    Filed: December 29, 2014
    Date of Patent: October 18, 2016
    Assignee: LEXXE PTY LTD
    Inventor: Hong Liang Qiao
  • Patent number: 9430464
    Abstract: A method, system and computer-usable medium are disclosed for identifying unchecked criteria in unstructured and semi-structured data within a form. Text spans representing unchecked criteria within unstructured text in a form are detected and classified to facilitate accurate interpretation of the text. Section identification and annotation operations are then performed to identify and categorize sections within the form. Checklist sections within the form, along with associated checkmarks and boxes, are then identified, followed by the identification of checked item, criteria scope, and previously undetected checklist sections. Once all checklist sections and checked criteria have been identified, remaining text spans within a checklist section are annotated as unchecked criteria.
    Type: Grant
    Filed: December 20, 2013
    Date of Patent: August 30, 2016
    Assignee: International Business Machines Corporation
    Inventors: Scott R. Carrier, Elena Romanova, Marie L. Setnes
  • Patent number: 9418122
    Abstract: A method and apparatus for dynamically adjusting the user interface of a search engine in order to effectively communicate the improved relevancy achieved through real-time implicit re-ranking of search results is described. Real-time implicit re-ranking occurs without delay after every user action as the search is being conducted, so finding methods of immediately altering the search page without disrupting the user experience is important. Graphical icons next to search results are employed to enable generating and removing re-ranked results, referred to as “recommended” search results. Clusters based on the real-time user model are also displayed to facilitate query reformulations. Sponsored links are selected using the real-time user model along with a combination of RPC and CTR information and are displayed in a manner similar to the organic results, or used to replace the initial sponsored links altogether.
    Type: Grant
    Filed: November 24, 2014
    Date of Patent: August 16, 2016
    Assignee: Surf Canyon Incorporated
    Inventor: Mark D. Cramer
  • Patent number: 8977628
    Abstract: Systems, methods, and computer-readable code stored on a non-transitory media for assessing an entity's innovation level by one or more computing devices include gathering information relating to an entity's performance in plural disciplines; capturing strengths and opportunities of the entity based on the gathered information; generating an innovation score of the entity; analyzing the innovation score to generate an innovation report; and returning the innovation report to the entity.
    Type: Grant
    Filed: January 3, 2012
    Date of Patent: March 10, 2015
    Assignee: Infosys Limited
    Inventor: Rajaram Venkataraman
  • Publication number: 20140101171
    Abstract: Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources.
    Type: Application
    Filed: November 8, 2012
    Publication date: April 10, 2014
    Applicant: ABBYY INFOPOISK LLC
    Inventors: Tatiana Danielyan, Konstantin Zuev
  • Patent number: 8671103
    Abstract: A method for semantic service registration and query based on WordNet is disclosed. The method includes the following steps: (1) semantic service registration: a service provider registers a service and uploads the Web Service Description Language (WSDL) document corresponding to the service, and a system parses the WSDL document to form a service description tree, then constructs a WordNet ontology tree according to the input of the service, performs a semantic annotation on the input/output of the service to form a Web Service Semantic Description Document (WSDL-S), and finally stores it in a register library; (2) semantic service discovery: a service requester inputs the information of service type, semantic information of the service input/output and other user-defined information to the register library to retrieve the services meeting the requirements; and (3) similarity sorting: the services meeting a certain threshold are sorted in descending order.
    Type: Grant
    Filed: April 7, 2010
    Date of Patent: March 11, 2014
    Assignee: Zhejiang University
    Inventors: Zhaohui Wu, Wenqiu Zeng, Jian Wu, Ying Li, Shuiguang Deng, Jianwei Yin
  • Publication number: 20140067832
    Abstract: Disclosed are methods for returning to a user an answer to the question “what is <string>.” Concepts and classes to which the concepts belong are determined from a corpus, such as taxonomy. The concepts are mapped to categories according to the structure of the taxonomy. Homonyms for words are collected and scored according to likeliness of use. Concept vectors are assembled for the identified concepts based on articles in the corpus and social media usage. Words are evaluated for generic-ness and a generic score is associated therewith. In responding to a query, the generic-ness of the terms of the query is evaluated and additional context solicited if the terms are generic. Candidate homonym concepts for a string in the query are selected according to context vectors for the homonym concepts. One or more homonym concepts are selected and the one or more categories corresponding to these concepts are returned.
    Type: Application
    Filed: September 28, 2012
    Publication date: March 6, 2014
    Applicant: Wal-Mart Stores, Inc.
    Inventors: Digvijay Singh Lamba, Xiaoyong Chai
  • Publication number: 20140040287
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for automatically parsing data from disparate data sources. In some implementations, actions include receiving first data from a first data source, identifying a first regular expression that corresponds to a data format of the first data, selecting a first set of parsing rules from a plurality of parsing rules based on the first regular expression, parsing the first data based on the first set of parsing rules to provide a first set of sub-data, populating data fields of a first data object with respective sub-data from the first set of sub-data, and transmitting the first data object to a computing device.
    Type: Application
    Filed: July 31, 2012
    Publication date: February 6, 2014
    Applicant: ACCENTURE GLOBAL SERVICES LIMITED
    Inventor: Eric Allan Frome
  • Publication number: 20140032582
    Abstract: According to an example, a method for matching regular expressions including word boundary symbols includes receiving an input string and receiving a regular expression including a word boundary symbol. The method further includes transforming, by a processor, the regular expression into an automaton such that a set of strings accepted by the automaton is the same as a set of strings described by the regular expression. The method also includes processing the input string by the automaton to determine if the input string matches the regular expression.
    Type: Application
    Filed: July 24, 2012
    Publication date: January 30, 2014
    Inventors: William G. Horne, Miranda Jane Felicity Mowbary
  • Publication number: 20140006369
    Abstract: In an example implementation, correlative patterns in structured data and in unstructured data are determined, where the determining includes finding a first pattern in the structured data and a second pattern in the unstructured data, and determining a degree of similarity between the first and second patterns. The structured data and unstructured data are processed according to the determined correlative patterns.
    Type: Application
    Filed: June 28, 2012
    Publication date: January 2, 2014
    Inventors: SEAN BLANCHFLOWER, DARREN JOHN GALLAGHER
  • Publication number: 20130325884
    Abstract: Techniques for comparing character strings include identifying a first character string having a first string length, and a second character string having a second string length greater than the first string length; parsing the first character string into one or more first sub-groups of characters; parsing the second character string into one or more second sub-groups of characters; comparing each of the one or more first sub-groups of characters against the one or more second sub-groups of characters; determining a ratio of a number of characters in the one or more first sub-groups of characters that match the one or more second sub-groups of characters and the second string length; and based on the ratio being greater or equal to a threshold, preparing at least one of the first or second character strings for display, the threshold including a variable value based on the first string length.
    Type: Application
    Filed: May 29, 2012
    Publication date: December 5, 2013
    Applicant: SAP PORTALS ISRAEL LTD.
    Inventors: Shachar Soel, Dmitry Gorenchteine, Udi Cohen
  • Publication number: 20130297597
    Abstract: A data storage system may contain a changeable database that includes: advisory information that includes the content of multiple advisory statements; query information that includes the content of multiple user queries; and flow logic information indicating a sequence for the delivery of the advisory statements and the user queries based on answers to the user queries. A user interface may deliver the user queries and the advisory statements to a user and receive answers in response to the user queries from the user. An information delivery engine that is separate from the changeable database may cause the user interface to deliver the user queries and the advisory statements to the user in a sequence based on the user's answers to the user queries and the flow logic information. This in system may be adopted to troubleshooting equipment, such as an optical network terminal.
    Type: Application
    Filed: May 7, 2012
    Publication date: November 7, 2013
    Applicant: Verizon Patent and Licensing Inc.
    Inventor: Ramesh V. SHAASTRI
  • Publication number: 20130297619
    Abstract: A system is presented that profiles authors and social media data across different media platforms and is capable of determining the author's overall social impact. In one aspect, this is accomplished by using a data retrieval service to trawl various web-sites and social media platforms for information about authors which can then be associated with those authors in a profile database. In one example, an author may post an entry on his/her blog and the data retrieval service can access the profile information of the author, on the blog, where various aspects of the profile information (e.g., real name, employee information, home address) can be matched with candidates in a profile database. From the information gathered, authors can be linked across multiple, different platforms, and an overall social impact of each of the authors can be determined.
    Type: Application
    Filed: May 7, 2012
    Publication date: November 7, 2013
    Applicant: The NASDAQ OMX Group, Inc.
    Inventors: Deepak CHANDRASEKARAN, David COSTELLO, Paul Stubbs
  • Publication number: 20130282736
    Abstract: The systems and methods described herein relate to making activity based connections. In an example embodiment a web based system can connect people together based on one or more items on an activity list such as a “bucket list.” For example, by collecting an activity list from a first user including at least one activity, collecting an activity list from a second user including at least one activity, analyzing the activity list of the first and second users to determine if the activities are the same or similar, and matching the first user to the second user based on the activity list of each user.
    Type: Application
    Filed: April 23, 2012
    Publication date: October 24, 2013
    Inventors: Kate Baldoni, Jillian Garton