Of Unstructured Textual Data (epo) Patents (Class 707/E17.058)
E Subclasses
-
Patent number: 11977569Abstract: Disclosed is a natural language processing pipeline that analyzes and processes a corpus of textual data to automatically create a knowledge graph containing the corpus entities such as subjects and object and their relationships such as predicates or verbs. The pipeline is configured as an end-to-end neural Open Schema Construction pipeline having a coreference resolution module, an open information extraction (OIE) module, and an entity canonicalization module. The processed textual data is input to a graph database to create the knowledge graph displayable through a graphical user interface. In operation, the pipeline modules serve to create a single term for all entity mentions in the corpus that reference the same entity through coreference resolution, extract all subject-predicate-object triplets from the coreference resolved corpus through OIE, and then canonicalize the corpus by clustering each entity mention to a canonical form for mapping to the knowledge graph and display.Type: GrantFiled: January 28, 2022Date of Patent: May 7, 2024Assignee: The United States of America, Represented by the Secretary of the NavyInventors: Michael Lynn Potter, Natalie Lynn Larson, Amy Cheng, Hovanes Keseyan, Hanh Servin
-
Patent number: 11914629Abstract: The present invention relates to information retrieval. In order to facilitate a search and identification of documents, there is provided a computer-implemented method for training a classifier model for data classification in response to a search query.Type: GrantFiled: August 5, 2021Date of Patent: February 27, 2024Assignee: BASF SEInventors: Arunav Mishra, Henning Schwabe, Lalita Shaki Uribe Ordonez
-
Patent number: 11914961Abstract: Systems, devices, and methods of the present invention involve discourse trees. In an example, a method involves generating a discourse tree. The method includes identifying, from the discourse tree, a central entity that is associated with a rhetorical relation of type elaboration and corresponds to a topic node that identifies a central entity of the text. The method includes determining a subset of elementary discourse units of the discourse tree that are associated with the central entity. The method includes forming generalized phrases from the subset of elementary discourse units. The method includes forming tuples from the generalized phrases, where a tuple is an ordered set of words in normal form. The method involves responsive to successfully converting an elementary discourse unit associated with an identified tuple into a logical representation, updating the ontology with an entity from the identified tuple.Type: GrantFiled: September 3, 2021Date of Patent: February 27, 2024Assignee: Oracle International CorporationInventor: Boris Galitsky
-
Patent number: 11900181Abstract: A data object from a data source is received by a distributed process in a data stream. The distributed process has a sequence of categories, each category containing one or more tasks that operate on the data object. The data object includes files that can be processed by the tasks. If the task is able to operate on the data object, then the data object is passed to the task. If the task is unable to operate on the data object, then the files in the data object are passed to a file staging area of the distributed process and stored in memory. The files in the file staging area are passed, in sequence, from the file staging area to the task that was unable to operate on the data object. The data object is outputted to a next category or data sink after being operated on by the task.Type: GrantFiled: April 28, 2021Date of Patent: February 13, 2024Assignee: FAIR ISAAC CORPORATIONInventors: Shalini Raghavan, Tom J. Traughber, George Vanecek, Jr.
-
Patent number: 11900177Abstract: In an example embodiment, a graphical user interface-based software tool is provided that uses integrated process information and information of a technical infrastructure to provide automatically-analyze integrations. The tool displays a list of integrations available with one or more corresponding software products linked to the first process by the linkage, each integration in the list of integrations identifying a separate software product with a separate application program interface (API) from the first software product.Type: GrantFiled: December 6, 2021Date of Patent: February 13, 2024Assignee: SAP SEInventor: Daniel Oberle
-
Patent number: 11886478Abstract: One or more computing devices, systems, and/or methods are provided. In an example, a first performance metric score may be determined based upon first content item text. A plurality of similarity scores associated with a plurality of sets of content item text may be determined. One or more sets of content item text may be selected from among the plurality of sets of content item text based upon the plurality of similarity scores and a plurality of performance metric scores associated with the plurality of sets of content item text. The plurality of performance metric scores may comprise one or more performance metric scores associated with the one or more sets of content item text. The one or more performance metric scores may be higher than the first performance metric score. One or more representations of the one or more sets of content item text may be displayed.Type: GrantFiled: May 7, 2021Date of Patent: January 30, 2024Assignee: Yahoo Assets LLCInventors: Shaunak Mishra, Changwei Hu, Kevin Yen, Manisha Verma, Yifan Hu, Maxim Ivanovich Sviridenko, Avinash Chukka, Max Edward Beech, Chao-Hung Wang, Hua-Ying Tsai, Kamil Michal Zasadzinski, Wei Yu Lin, Yu Tian
-
Patent number: 11869131Abstract: A system and method for determining an absolute position of an object in an area is presented. The system includes a server having a processor, and a plurality of camera nodes coupled to the server. Each node includes a camera that acquires images of the object and area. The server receives image data from a camera, detects the object within an approximate location by image analysis techniques, and determines a relative position of the object in pixel coordinates. The processor then detects stationary markers proximate to the relative location of the object, determines an absolute position of the detected markers relative to known markers to define an absolute position of the marker, and determines an absolute location of the object in relation to the absolute location of the detected marker. This absolute position of the object is provided to an official to accurately locate the object in the area.Type: GrantFiled: August 30, 2022Date of Patent: January 9, 2024Assignee: PRECISION POINT SYSTEMS, LLCInventors: Daniel Kohler, Terence Sauer, David Schroeder, Dennis Wanzie
-
Patent number: 11847499Abstract: Systems and methods for coordinating components can include: determining, by a first application executing on a client device, a need to perform a sharable functional task; identifying a first software component installed on the client device and capable of performing a first variation of the sharable functional task; identifying a second software component installed on the client device and capable of performing a second variation of the sharable functional task, wherein the second variation of the sharable functional task is functionally overlapping with and not identical to the first variation; identifying a set of characteristics of both the first software component and the second software component; selecting the second software component for performing the sharable functional task based on the set of characteristics, where the set of characteristics includes at least a version number; and delegating performance of the sharable functional task to the second software component.Type: GrantFiled: December 15, 2021Date of Patent: December 19, 2023Assignee: LOOKOUT INC.Inventors: Matthew John Joseph LaMantia, Brian James Buck, Stephen J. Edwards, William Neil Robinson
-
Patent number: 11797275Abstract: System and methods of ontology model development are disclosed. An example system and method may comprise, defining, an ontological model, generating, an ontology library based on the ontological model, deploying, the ontology library to an IoT system, generating, an ontology instance based on the ontology library deployed to the IoT system, modifying, an IoT application based on the ontology instance, and managing the IoT system utilizing the IoT application.Type: GrantFiled: July 9, 2018Date of Patent: October 24, 2023Assignee: SCHNEIDER ELECTRIC USA, INC.Inventors: Charbel Joseph El Kaed, André Ponnouradjane
-
Patent number: 11789949Abstract: A partition key format for allocating partitions to data items in a single table database, where the data items are owned by different entities. The partition key format including a sequence of a plurality of frames, wherein a first of said frames is an identifier of the requesting entity (EID), and a second one of said frames is an identifier of the type of data item (TID).Type: GrantFiled: June 23, 2021Date of Patent: October 17, 2023Assignee: Command Alkon IncorporatedInventors: Douglas A. Moore, Todd McPartlin
-
Patent number: 11775596Abstract: Some embodiments provide a method for defining a content relevance model for determining whether a content segment is relevant to a particular category. The method receives a first set of content segments that contain content relevant to the particular category and a second set of content segments that contain content not relevant to the particular category. The method identifies a set of key word sets more likely to appear in the first set of content segments than the second set of content segments. The method defines a content relevance model that comprises a set of groups of word sets and a score for each group, each of the groups of word sets comprising a key word set from the set of key word sets and at least one word set found in a context of the key word set in at least one of the received content segments.Type: GrantFiled: April 21, 2022Date of Patent: October 3, 2023Assignee: Aurea Software, Inc.Inventors: Ashutosh Joshi, Martin Betz, Rajiv Arora, Rakesh Kumar Srivastava, David Cooke
-
Patent number: 11776291Abstract: Systems and methods for generation and use of document analysis architectures are disclosed. A model builder component may be utilized to receiving user input data for labeling a set of documents as in class or out of class. That user input data may be utilized to train one or more classification models, which may then be utilized to predict classification of other documents. Trained models may be incorporated into a model taxonomy for searching and use by other users for document analysis purposes.Type: GrantFiled: June 10, 2020Date of Patent: October 3, 2023Assignee: AON RISK SERVICES, INC. OF MARYLANDInventors: Samuel Cameron Fleming, David Craig Andrews, John E. Bradley, III, Lewis C. Lee, Jared Dirk Sol, Timothy Seegan, Scott Buzan
-
Patent number: 11762893Abstract: Creating a summary of a plurality of texts includes tokenizing each of a plurality of texts to obtain tokens; generating a vector space using a first set of vectors having one or more obtained feature scores equal to or larger than a predefined value; executing non-hierarchical clustering using the vector space to generate a first plurality of clusters; choosing a first representative text in each of the plurality of clusters; generating a second set of vectors from each of the arrays generated based on a number of characters included in tokens of the representative texts; executing hierarchical clustering using the second set of vectors to generate a second plurality of clusters; and in response to a determining a number of clusters included in the second plurality of clusters, determining a second representative text for each of the clusters included in the second plurality of clusters.Type: GrantFiled: December 14, 2021Date of Patent: September 19, 2023Assignee: International Business Machines CorporationInventors: Yu Gu, Takayuki Kushida, Hiroki Nakano, Yaoping Ruan, Yuji Sugiyama
-
Patent number: 11755843Abstract: Systems and techniques that facilitate spurious relationship filtration from external knowledge graphs based on distributional semantics of an input corpus are provided. In one or more embodiments, a context component can generate a context-based word embedding of one or more first terms in a document collection. The embedding can yield vector representations of the one or more first terms. The one or more first terms can correspond to knowledge terms in one or more first nodes of a knowledge graph. In one or more embodiments, a filtering component can filter out a relationship between the one or more first nodes and a second node of the knowledge graph based on a similarity value being less than a threshold. The similarity value can be a function of the vector representations of the one or more first terms. In various embodiments, cosine similarity can be used to compute the similarity value.Type: GrantFiled: May 18, 2021Date of Patent: September 12, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Nandana Mihindukulasooriya, Robert G. Farrell, Nicolas Rodolfo Fauceglia, Alfio Massimiliano Gliozzo
-
Patent number: 11722507Abstract: The disclosed embodiments relate to a system that generates an alert based on information extracted from search results generated by a query. During operation, the system executes the query to generate the search results. The system also obtains configuration information for the alert, wherein the configuration information identifies information associated with the search results, and also specifies a trigger condition for the alert. Next, when the trigger condition for the alert is met, the system uses the configuration information to generate a payload containing the identified information associated with the search results. The system then invokes alert-generating functionality and provides the payload as input to the alert-generating functionality. This enables the alert-generating functionality to use the information from the search results while performing one or more alert actions association with the alert.Type: GrantFiled: March 18, 2022Date of Patent: August 8, 2023Assignee: Splunk Inc.Inventors: Nicholas J. Filippi, Siegfried Puchbauer-Schnabel, Carl S. Yestrau, Vivian Shen, J. Mathew Elting
-
Patent number: 11710496Abstract: A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.Type: GrantFiled: July 1, 2019Date of Patent: July 25, 2023Assignee: Google LLCInventors: Aaron Donsbach, Dirk Padfield
-
Patent number: 11699447Abstract: Systems and methods are provided herein for determining one or more traits of a speaker based on voice analysis to present content item to the speaker. In one example, the method receives a voice query and determines whether the voice query matches within a first confidence threshold of a speaker identification (ID) among a plurality of speaker IDs stored in a speaker profile. In response to determining that the voice query matches to the speaker ID within the first confidence threshold, the method bypasses a trait prediction engine and retrieves a trait among the plurality of traits in the speaker profile associated with the matched speaker ID. The method further provides a content item based on the retrieved trait.Type: GrantFiled: June 22, 2020Date of Patent: July 11, 2023Assignee: ROVI GUIDES, INC.Inventors: Ankur Anil Aher, Jeffry Copps Robert Jose
-
Patent number: 11651152Abstract: A method may include obtaining a document and using a first prediction model to generate text block scores for text blocks in the document, where a first text block of the text blocks is associated with a first text block score of the plurality of text block scores. The method also includes updating, in response to the first text block score for the first text block failing to satisfy a criterion, a modified version of the document with an indicator to set the first text block as a hidden text block in a presentation of the modified version. The method also includes generating a summarization of the first text block based on the words in the first text block and updating the modified version of the document to include the summarization. The method also includes providing the modified version of the document to a user device.Type: GrantFiled: June 10, 2021Date of Patent: May 16, 2023Assignee: Capital One Services, LLCInventors: Austin Walters, Anh Truong, Jeremy Goodsitt, Vincent Pham, Galen Rafferty, Reza Farivar
-
Patent number: 11636530Abstract: Aspects of the disclosure relate to content prediction. A computing platform may train a collaborative recommendation engine to output recommendation information based on historical preference information and corresponding data drift. The computing platform may receive an account access request from a user device. The computing platform may identify, at a first time and using the collaborative recommendation engine, a preference group for the user device. The computing platform may receive, at a second time later than the first time, a second account access request from the user device. The computing platform may identify, using the collaborative recommendation engine, the preference group and data drift corresponding to the preference group between the first time and the second time, which may indicate a second set of preferences at the second time. The computing platform may generate, based on the second set of preferences, recommendation information for the user device.Type: GrantFiled: June 28, 2021Date of Patent: April 25, 2023Assignee: Bank of America CorporationInventor: Maharaj Mukherjee
-
Patent number: 11550810Abstract: Methods and systems for providing computer-assisted guided review of unstructured data to generate a structured data output based on customizable template rules. In embodiments, an unstructured file is received, and a predefined template is selected. The predefined template includes a plurality of fields, each field corresponding to a field of the structured report. The predefined template also defines extraction rules for each field of the predefined template, and the extraction rules define parameters for identifying unstructured data relevant to the associated field. The extraction rules are applied to the unstructured file to identify data relevant to the field associated with the corresponding extraction rule, and the data identified as relevant is confirmed. Confirming the relevant data includes determining to refine the relevant data based on a condition, and modifying the extraction rule associated with the field to refine the relevant data.Type: GrantFiled: February 6, 2019Date of Patent: January 10, 2023Assignee: Thomson Reuters Enterprise Centre GmbHInventors: Hella-Franziska Hoffmann, Johannes Schleith
-
Patent number: 11544467Abstract: The present disclosure relates to processing operations configured to provide a linguistic-based approach to evaluating repetition in content of an electronic document. The approach of the present disclosure is about detecting terms/words/phrases that are likely to be perceived as being repetitious by native speakers of a language rather than just identifying the occurrence of identical words or strings in a document as done by traditional language checks. Processing of the present disclosure detects and evaluates terms or phrases using positive linguistic evidence derived from evaluation of linguistic relationships between words in a string in syntactic ways. This results in more accurate and efficient determination as to whether a term is truly repetitious at the linguistic level as compared with traditional language checks.Type: GrantFiled: June 15, 2020Date of Patent: January 3, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Davide Turcato, Alfredo R. Arnaiz, Domenic Joseph Cipollone, Michael Wilson Daniels
-
Patent number: 11522911Abstract: Methods, systems, and computer program products for performing passive and active identity verification in association with online communications. For example, a computer-implemented method may include receiving one or more electronic messages associated with a user account, analyzing the electronic messages based on a plurality of identity verification profiles associated with the user account, generating an identity trust score associated with the electronic messages based on the analyzing, determining whether to issue a security challenge in response to the electronic messages based on the generated identity trust score, and issuing the security challenge in response to the electronic messages based on the determining.Type: GrantFiled: April 19, 2021Date of Patent: December 6, 2022Assignee: PAYPAL, INC.Inventors: Bradley Wardman, Jakub Ceiran Burgis, Nicole Harris, Blake Butler, Nathan Robert Pratt, Kevin James Tyers
-
Patent number: 11521091Abstract: A computer-implemented method, a computer program product, and a computer system for enhanced distributed machine learning. A fusion server in a distributed machine learning system determines correlation relationships across agents in the distributed machine learning system, based on auxiliary information. The fusion server clusters the agents to form one or more communities, based on the correlation relationships. The fusion server selects, from the one or more communities, participating agents that participate in the enhanced distributed machine learning.Type: GrantFiled: May 22, 2019Date of Patent: December 6, 2022Assignee: International Business Machines CorporationInventors: Changchang Liu, Shiqiang Wang, Wei-Han Lee, Seraphin Bernard Calo
-
Patent number: 11481389Abstract: Methods, systems, and computer program products for generating an executable code based on a document are disclosed. Rules are identified in a document, the identified rules are translated into encoded rules, and an executable code is generated from the encoded rules. Identification of rules includes splitting a text of the document into a plurality of sentences; and for each sentence of the plurality of sentences, determining whether the sentence corresponds to a rule. Translation of an identified rule into an encoded rule includes extracting, from the identified rule, elements corresponding to predefined categories; determining one or more relationships between the extracted elements; and translating the one or more determined relationships into a structured expression. Generating the executable code from the encoded rules includes translating the structured expression associated with the identified rule into a programming language query.Type: GrantFiled: June 20, 2018Date of Patent: October 25, 2022Inventors: Youness Mansar, Sira Ferradans
-
Patent number: 11461343Abstract: Embodiments of the systems and methods disclosed herein provide a prescriptive analytics platform and polarity analysis engine in which a user can identify a target objective and use the system to find out whether the user's objectives are being met, what predictive factors are positively or negatively affecting the targeted objectives, as well as what recommended changes the user can make to better meet the objectives. The systems and methods may include a polarity analysis engine configured to determine the polarity of terms in free-text input in view of the target objective and the predictive factors and use the polarity to generate the recommended changes.Type: GrantFiled: February 4, 2021Date of Patent: October 4, 2022Assignee: Clearsense Acquisition 1, LLCInventors: Adrian Marc Bir, Nikolai Nikolaevich Liachenko, Daniel Brooks Presley
-
Patent number: 11449496Abstract: An example embodiment may involve a software application executable on computing devices of a remote network management platform and a computation instance associated with a managed network. The computational instance may contain a database storing data of the managed network. The software application may receive, from a client device of the managed network, a natural language query (NLQ), and retrieve Backus-Naur form (BNF) rules and a set of metadata associated with the BNF rules. The metadata may include a text-based description of a schema of the database and abbreviations associated with the BNF rules. The NLQ may be parsed using the BNF rules together with the metadata by applying the metadata during parsing to extend the BNF rules. A query object based on the parsed query may be generated, and the database searched using the query object. A result of the search may be transmitted to the client device.Type: GrantFiled: October 25, 2019Date of Patent: September 20, 2022Assignee: ServiceNow, Inc.Inventors: Mikhail Rumiantsau, Aliaksei Vertsel
-
Patent number: 11392675Abstract: Methods, systems, and computer-readable media for request authorization using service coordination are disclosed. An authorization data structure and an operation data structure are selected based at least in part on a request for an operation. The authorization data structure comprises a directed acyclic graph representing a flow of data between service operations associated with authorization of the operation, and the operation data structure comprises a directed acyclic graph representing a flow of data between a service operations associated with execution of the operation. Authorization of the operation is attempted using the authorization data structure, comprising invoking one or more of the service operations associated with authorization. If the authorization is successful, then the execution of the operation is initiated using the operation data structure, comprising invoking one or more of the service operations associated with execution.Type: GrantFiled: April 24, 2020Date of Patent: July 19, 2022Assignee: Amazon Technologies, Inc.Inventors: Robin Alan Golden, Marc Andrew Bowes, Izak Van Der Merwe
-
Patent number: 11341146Abstract: Data may be queried and analyzed in order to draw insights. One type of data query that may be performed is a funnel query. A funnel query is a query characterized by a sequence of events, e.g.: “In the last N days, how many unique users performed event A, then event B, and then event C”. Systems and methods for performing funnel queries are provided herein. In some embodiments, the speed at which a computer can answer a funnel query may be increased. In some embodiments, a bitmap is used to eliminate one or more sequences of events that would otherwise need to be traversed during the funnel query. In some embodiments, a sequence of events is stored across multiple data partitions, each data partition covering a different period of time.Type: GrantFiled: June 21, 2019Date of Patent: May 24, 2022Assignee: SHOPIFY INC.Inventors: Mikhal Arkhangorodsky, Mohammad Zeeshan Qureshi
-
Patent number: 11308096Abstract: Systems and methods for debiasing a recommendation engine are disclosed herein. A search query associated with a user profile is received at a recommendation engine. Control circuitry generates a result set of items of content based on the search query and generates a bias score for a content attribute based on the result set. The control circuitry also generates a time-averaged bias score for the content attribute based on a plurality of search queries associated with the user profile. Based on the bias score and the time-averaged bias score, the control circuitry determines whether a bias is signaled for the content attribute. Finally, the control circuitry outputs, for display via a computing device, the result set or a debiased result set based on a result of the determination of whether the bias is signaled.Type: GrantFiled: March 29, 2019Date of Patent: April 19, 2022Assignee: Rovi Guides, Inc.Inventors: Rajendran Pichaimurthy, Madhusudhan Srinivasan
-
Patent number: 10592565Abstract: An objective of the present invention is providing a method and apparatus for providing recommended information. A method according to the present invention comprises steps of: determining, based on one or more pieces of content information in one or more webpages, whether the one or more pieces of content information may be used as recommended information, respectively; obtaining feature information of the recommended information if the content information is recommended information; determining ordering information of the each piece of recommended information based on the feature information of each piece of recommended information; wherein the method further comprises the following step: if a user's browsing operation on the webpage corresponds to at least one piece of recommended information, presenting the at least one piece of recommended information.Type: GrantFiled: December 31, 2014Date of Patent: March 17, 2020Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.Inventors: Chi Tang, Huadong Li, Weiyu Chen, Jiajia Chen, Xunliang Cai, Yang Song
-
Patent number: 10579663Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.Type: GrantFiled: May 2, 2017Date of Patent: March 3, 2020Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu
-
Patent number: 10394875Abstract: A document relationship analysis system. Aspects of the system include ingesting, discovering, recommending, analyzing, and exporting documents of interest. The system dynamically searches large or streaming datasets using a tiered, multi-step approach that includes discovery techniques and recommender components to filter and refine these larger datasets to smaller datasets of documents of interest. The system dynamically selects and renders an appropriate visualization for result datasets based on predetermined measures that allow for facilitate analysis of the documents of interest.Type: GrantFiled: February 16, 2018Date of Patent: August 27, 2019Assignee: VortexT Analytics, Inc.Inventors: Matthew Cody Lambert, Peter Joseph Angerani, Gregory David Ostermayr
-
Patent number: 10255283Abstract: A mechanism for progressive topic modeling is disclosed to facilitate document content analysis. Input documents can be sorted and divided into multiple groups. Topic modeling is performed for each group, where the topic modeling for one group is based on the generated topic model from a previous group, if available. The vocabulary used in the topic modeling process can also be updated for each group of documents. The generated topics can be presented in a user interface to facilitate a user in analyzing the documents. The topic modeling mechanism can also be utilized to enhance a document search experience by generating topics from documents contained in search results and presenting topic words to a user as suggested search terms.Type: GrantFiled: September 19, 2016Date of Patent: April 9, 2019Assignee: Amazon Technologies, Inc.Inventors: Weiwei Cheng, Christopher Gonzales
-
Patent number: 9928295Abstract: A document relationship analysis system. Aspects of the system include ingesting, discovering, recommending, analyzing, and exporting documents of interest. The system dynamically searches large or streaming datasets using a tiered, multi-step approach that includes discovery techniques and recommender components to filter and refine these larger datasets to smaller datasets of documents of interest. The system dynamically selects and renders an appropriate visualization for result datasets based on predetermined measures that allow for facilitate analysis of the documents of interest.Type: GrantFiled: February 2, 2015Date of Patent: March 27, 2018Assignee: Vortext Analytics, Inc.Inventors: Matthew Cody Lambert, Peter Joseph Angerani, Gregory David Ostermayr
-
Patent number: 9773049Abstract: A method includes receiving a first request from a first user device for first data, where the first request identifies a first data source and sending a first data access request to the first data source, where the first data access request is based on a first reader object associated with the first data source. The method also includes receiving the first data from the first data source, where the first data has a first format, and transforming the first data to normalized data in a normalized format based on the first reader object. The method further includes selecting a first presentation object from a database comprising a plurality of presentation objects based on a first device type of the first user device and transforming the normalized data to output data in an output format based on the first presentation object.Type: GrantFiled: June 16, 2015Date of Patent: September 26, 2017Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: Bryan Davis, Dan Musgrove, Michael Raftelis
-
Patent number: 9753921Abstract: A content management system including a document management system provides documents that include comments entered by users. Comments are organized into threads; each thread is associated with a span of text in the document. When a user requests access to a document, the document management system determines which threads are visible to the user based on an audience associated with each thread. the audience comprises the user identifiers of i) the author of the document containing the thread; ii) the authors of comments included in the thread; iii) the authors of any text included in the text span for the thread; iv) any user mentioned in the text span the thread via a user primitive; v) any user mentioned in a comment via user primitive.Type: GrantFiled: April 29, 2015Date of Patent: September 5, 2017Assignee: DROPBOX, INC.Inventors: Anthony DeVincenzi, Matthew Blackshaw, Balabhadra Graveley, Igor Kofman
-
Patent number: 9471644Abstract: A computer-implemented method, computer-readable medium and system for scoring a text are disclosed. Themes within one or more texts may be determined and used to score each text, where an overall score for each text may indicate a respective importance and/or value of each text. The score for each text may be determined based upon a number of themes, type of themes, frequency of theme elements associated with the themes, distribution of theme elements associated with the themes, location of themes in the text, some combination thereof, etc. In this manner, the importance or value of one or more texts may be determined more accurately using information within each text with reduced reliance upon external information. Additionally, more relevant search results can be returned to a user by using internal information to perform ranking operations and/or filtering operations associated with a search.Type: GrantFiled: December 29, 2014Date of Patent: October 18, 2016Assignee: LEXXE PTY LTDInventor: Hong Liang Qiao
-
Patent number: 9430464Abstract: A method, system and computer-usable medium are disclosed for identifying unchecked criteria in unstructured and semi-structured data within a form. Text spans representing unchecked criteria within unstructured text in a form are detected and classified to facilitate accurate interpretation of the text. Section identification and annotation operations are then performed to identify and categorize sections within the form. Checklist sections within the form, along with associated checkmarks and boxes, are then identified, followed by the identification of checked item, criteria scope, and previously undetected checklist sections. Once all checklist sections and checked criteria have been identified, remaining text spans within a checklist section are annotated as unchecked criteria.Type: GrantFiled: December 20, 2013Date of Patent: August 30, 2016Assignee: International Business Machines CorporationInventors: Scott R. Carrier, Elena Romanova, Marie L. Setnes
-
Patent number: 9418122Abstract: A method and apparatus for dynamically adjusting the user interface of a search engine in order to effectively communicate the improved relevancy achieved through real-time implicit re-ranking of search results is described. Real-time implicit re-ranking occurs without delay after every user action as the search is being conducted, so finding methods of immediately altering the search page without disrupting the user experience is important. Graphical icons next to search results are employed to enable generating and removing re-ranked results, referred to as “recommended” search results. Clusters based on the real-time user model are also displayed to facilitate query reformulations. Sponsored links are selected using the real-time user model along with a combination of RPC and CTR information and are displayed in a manner similar to the organic results, or used to replace the initial sponsored links altogether.Type: GrantFiled: November 24, 2014Date of Patent: August 16, 2016Assignee: Surf Canyon IncorporatedInventor: Mark D. Cramer
-
Patent number: 8977628Abstract: Systems, methods, and computer-readable code stored on a non-transitory media for assessing an entity's innovation level by one or more computing devices include gathering information relating to an entity's performance in plural disciplines; capturing strengths and opportunities of the entity based on the gathered information; generating an innovation score of the entity; analyzing the innovation score to generate an innovation report; and returning the innovation report to the entity.Type: GrantFiled: January 3, 2012Date of Patent: March 10, 2015Assignee: Infosys LimitedInventor: Rajaram Venkataraman
-
Publication number: 20140101171Abstract: Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources.Type: ApplicationFiled: November 8, 2012Publication date: April 10, 2014Applicant: ABBYY INFOPOISK LLCInventors: Tatiana Danielyan, Konstantin Zuev
-
Patent number: 8671103Abstract: A method for semantic service registration and query based on WordNet is disclosed. The method includes the following steps: (1) semantic service registration: a service provider registers a service and uploads the Web Service Description Language (WSDL) document corresponding to the service, and a system parses the WSDL document to form a service description tree, then constructs a WordNet ontology tree according to the input of the service, performs a semantic annotation on the input/output of the service to form a Web Service Semantic Description Document (WSDL-S), and finally stores it in a register library; (2) semantic service discovery: a service requester inputs the information of service type, semantic information of the service input/output and other user-defined information to the register library to retrieve the services meeting the requirements; and (3) similarity sorting: the services meeting a certain threshold are sorted in descending order.Type: GrantFiled: April 7, 2010Date of Patent: March 11, 2014Assignee: Zhejiang UniversityInventors: Zhaohui Wu, Wenqiu Zeng, Jian Wu, Ying Li, Shuiguang Deng, Jianwei Yin
-
Publication number: 20140067832Abstract: Disclosed are methods for returning to a user an answer to the question “what is <string>.” Concepts and classes to which the concepts belong are determined from a corpus, such as taxonomy. The concepts are mapped to categories according to the structure of the taxonomy. Homonyms for words are collected and scored according to likeliness of use. Concept vectors are assembled for the identified concepts based on articles in the corpus and social media usage. Words are evaluated for generic-ness and a generic score is associated therewith. In responding to a query, the generic-ness of the terms of the query is evaluated and additional context solicited if the terms are generic. Candidate homonym concepts for a string in the query are selected according to context vectors for the homonym concepts. One or more homonym concepts are selected and the one or more categories corresponding to these concepts are returned.Type: ApplicationFiled: September 28, 2012Publication date: March 6, 2014Applicant: Wal-Mart Stores, Inc.Inventors: Digvijay Singh Lamba, Xiaoyong Chai
-
Publication number: 20140040287Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for automatically parsing data from disparate data sources. In some implementations, actions include receiving first data from a first data source, identifying a first regular expression that corresponds to a data format of the first data, selecting a first set of parsing rules from a plurality of parsing rules based on the first regular expression, parsing the first data based on the first set of parsing rules to provide a first set of sub-data, populating data fields of a first data object with respective sub-data from the first set of sub-data, and transmitting the first data object to a computing device.Type: ApplicationFiled: July 31, 2012Publication date: February 6, 2014Applicant: ACCENTURE GLOBAL SERVICES LIMITEDInventor: Eric Allan Frome
-
Publication number: 20140032582Abstract: According to an example, a method for matching regular expressions including word boundary symbols includes receiving an input string and receiving a regular expression including a word boundary symbol. The method further includes transforming, by a processor, the regular expression into an automaton such that a set of strings accepted by the automaton is the same as a set of strings described by the regular expression. The method also includes processing the input string by the automaton to determine if the input string matches the regular expression.Type: ApplicationFiled: July 24, 2012Publication date: January 30, 2014Inventors: William G. Horne, Miranda Jane Felicity Mowbary
-
Publication number: 20140006369Abstract: In an example implementation, correlative patterns in structured data and in unstructured data are determined, where the determining includes finding a first pattern in the structured data and a second pattern in the unstructured data, and determining a degree of similarity between the first and second patterns. The structured data and unstructured data are processed according to the determined correlative patterns.Type: ApplicationFiled: June 28, 2012Publication date: January 2, 2014Inventors: SEAN BLANCHFLOWER, DARREN JOHN GALLAGHER
-
Publication number: 20130325884Abstract: Techniques for comparing character strings include identifying a first character string having a first string length, and a second character string having a second string length greater than the first string length; parsing the first character string into one or more first sub-groups of characters; parsing the second character string into one or more second sub-groups of characters; comparing each of the one or more first sub-groups of characters against the one or more second sub-groups of characters; determining a ratio of a number of characters in the one or more first sub-groups of characters that match the one or more second sub-groups of characters and the second string length; and based on the ratio being greater or equal to a threshold, preparing at least one of the first or second character strings for display, the threshold including a variable value based on the first string length.Type: ApplicationFiled: May 29, 2012Publication date: December 5, 2013Applicant: SAP PORTALS ISRAEL LTD.Inventors: Shachar Soel, Dmitry Gorenchteine, Udi Cohen
-
Publication number: 20130297597Abstract: A data storage system may contain a changeable database that includes: advisory information that includes the content of multiple advisory statements; query information that includes the content of multiple user queries; and flow logic information indicating a sequence for the delivery of the advisory statements and the user queries based on answers to the user queries. A user interface may deliver the user queries and the advisory statements to a user and receive answers in response to the user queries from the user. An information delivery engine that is separate from the changeable database may cause the user interface to deliver the user queries and the advisory statements to the user in a sequence based on the user's answers to the user queries and the flow logic information. This in system may be adopted to troubleshooting equipment, such as an optical network terminal.Type: ApplicationFiled: May 7, 2012Publication date: November 7, 2013Applicant: Verizon Patent and Licensing Inc.Inventor: Ramesh V. SHAASTRI
-
Publication number: 20130297619Abstract: A system is presented that profiles authors and social media data across different media platforms and is capable of determining the author's overall social impact. In one aspect, this is accomplished by using a data retrieval service to trawl various web-sites and social media platforms for information about authors which can then be associated with those authors in a profile database. In one example, an author may post an entry on his/her blog and the data retrieval service can access the profile information of the author, on the blog, where various aspects of the profile information (e.g., real name, employee information, home address) can be matched with candidates in a profile database. From the information gathered, authors can be linked across multiple, different platforms, and an overall social impact of each of the authors can be determined.Type: ApplicationFiled: May 7, 2012Publication date: November 7, 2013Applicant: The NASDAQ OMX Group, Inc.Inventors: Deepak CHANDRASEKARAN, David COSTELLO, Paul Stubbs
-
Publication number: 20130282736Abstract: The systems and methods described herein relate to making activity based connections. In an example embodiment a web based system can connect people together based on one or more items on an activity list such as a “bucket list.” For example, by collecting an activity list from a first user including at least one activity, collecting an activity list from a second user including at least one activity, analyzing the activity list of the first and second users to determine if the activities are the same or similar, and matching the first user to the second user based on the activity list of each user.Type: ApplicationFiled: April 23, 2012Publication date: October 24, 2013Inventors: Kate Baldoni, Jillian Garton