Based On Phrase, Clause, Or Idiom Patents (Class 704/4)
-
Patent number: 10635722Abstract: There are provided decentralized system and method of managing electronic documents of title (EDTs).Type: GrantFiled: April 20, 2016Date of Patent: April 28, 2020Assignee: OGY DOCS, INC.Inventors: Gad Ruschin, Or Garbash, Yair Sappir
-
Patent number: 10637826Abstract: An online system determines whether a test content item violates a policy of the online system. The online system extracts a semantic from the test content item and determines a distance between the extracted semantic vector and the stored semantic vectors for content items that have been labeled to indicate whether they violate a policy. Using a nearest neighbor search, the online system selects a set of the stored semantic vectors and assigns a weight to the selected semantic vectors that is inversely related to the distances. The online system then determines whether the test content item violates a policy using a weighed voting scheme, where the labels of the stored semantic vectors are aggregated based on their associated weights. The online system may first attempt to match the test content with known bad content and terminate the more complex nearest neighbor search if such a match is found.Type: GrantFiled: August 6, 2018Date of Patent: April 28, 2020Assignee: Facebook, Inc.Inventors: Enming Luo, Emanuel Alexandre Strauss
-
Patent number: 10615965Abstract: A messaging service provides a search mechanism that utilizes a protected index. The protected index is generated by converting documents maintained by the messaging service into a set of tokens or words. Each token is converted to a corresponding value using a transformation such as a cryptographic hash function. The values are placed into an index that allows the messaging service to efficiently identify a set of documents associated with each particular value. When a document search request is submitted to the messaging service, the messaging service uses the transformation to generate corresponding values for each term in the search request, and uses the index to identify sets of documents associated with the values corresponding to the search terms. The messaging service applies search logic associated with the search request to the identified sets of documents to produce a final set of documents satisfying search request.Type: GrantFiled: March 22, 2017Date of Patent: April 7, 2020Assignee: Amazon Technologies, Inc.Inventor: Matthew E. Goldberg
-
Patent number: 10614100Abstract: A method comprising using at least one hardware processor for: receiving a topic under consideration (TUC) and a set of claims referring to the TUC; identifying semantic similarity relations between claims of the set of claims; clustering the claims into a plurality of claim clusters based on the identified semantic similarity relations, wherein said claim clusters represent semantically different claims of the set of claims; and generating a list of non-redundant claims comprising said semantically different claims.Type: GrantFiled: April 29, 2015Date of Patent: April 7, 2020Assignee: International Business Machines CorporationInventors: Mitesh Khapra, Vikas Raykar, Amrita Saha, Noam Slonim, Ashish Verma
-
Patent number: 10599885Abstract: Systems, devices, and methods of the present invention uses noisy-robust discourse trees to determine a rhetorical relationship between one or more sentences. In an example, a rhetoric classification application creates a noisy-robust communicative discourse tree. The application accesses a document that includes a first sentence, a second sentence, a third sentence, and a fourth sentence. The application identifies that syntactic parse trees cannot be generated for the first sentence and the second sentence. The application further creates a first communicative discourse tree from the second, third, and fourth sentences and a second communicative discourse tree from the first, third, and fourth sentences. The application aligns the first communicative discourse tree and the second communicative discourse tree and removes any elementary discourse units not corresponding to a relationship that is in common between the first and second communicative discourse trees.Type: GrantFiled: June 15, 2018Date of Patent: March 24, 2020Assignee: Oracle International CorporationInventor: Boris Galitsky
-
Patent number: 10599731Abstract: Described is a technique for associating words used in a search query with categories. This technique aims to produce potentially more relevant search results by improving the associations with words used for a search. A machine learning technique is implemented to train a classification model, which may include a word embedding model. The classification model is trained to receive words as input and to create vectors of the words as output. These word vectors may then be mapped to a vector space and the technique may then perform a cluster analysis of the vectors. Based on the cluster analysis, clusters may be identified and each cluster may be associated with a corresponding category.Type: GrantFiled: April 26, 2016Date of Patent: March 24, 2020Assignee: BAIDU USA LLCInventors: Yu Zhu, Lin Li
-
Patent number: 10594638Abstract: An electronic chat session monitoring device intercepts a text message from an electronic chat session. The text message is generated by a sender and addressed to an addressee. The electronic chat session monitoring device receives a current photo of the sender of the text message electronic chat session, which is taken contemporaneously with a generation of the text message by the sender and depicts an emotion of the sender while generating the text message. The electronic chat session monitoring device then transmits both the text message and the current photo of the sender to the addressee.Type: GrantFiled: February 13, 2015Date of Patent: March 17, 2020Assignee: International Business Machines CorporationInventors: James E. Bostick, John M. Ganci, Jr., Sarbajit K. Rakshit, Kimberly G. Starks
-
Patent number: 10585985Abstract: Methods and systems for scoring written text based on use of idiomatic expressions, including reading pre-selected idiomatic expressions in a canonical form into memory, expanding idiomatic expressions from the canonical form, reading a written response into the memory, pre-processing the written response, searching the pre-processed written response for idiomatic expressions, and assigning a score to the written response. The score may be based at least in part on the number of idiomatic expressions in the written response. Corresponding apparatuses, systems, and methods are also disclosed.Type: GrantFiled: December 14, 2017Date of Patent: March 10, 2020Assignee: Educational Testing ServiceInventors: Michael Flor, Beata Beigman Klebanov
-
Patent number: 10574605Abstract: A mechanism is provided for validating the tone of an electronic communication being composed based on the recipients of the electronic communication. At least one tone of the electronic communication being composed by a sender and an identity of each of one or more recipients to whom the electronic communication is to be sent and the sender are identified. One or more previous electronic communications sent to or received from one or more of the one or more recipients and at least one tone of each of the one or more previous electronic communications are identified in order to generate one or more preferred tones. The tone of the electronic communication being composed is compared to the one or more preferred tones. Responsive to identifying a tone discrepancy between the electronic communication being composed and the one or more preferred tones, a notification is presented to the sender.Type: GrantFiled: May 18, 2016Date of Patent: February 25, 2020Assignee: International Business Machines CorporationInventors: Florian Pinel, Edward E. Seabolt
-
Patent number: 10574607Abstract: A mechanism is provided for validating an attachment to an electronic communication being composed based on the recipients of the electronic communication. An associated tone or theme of the at least one attachment to the electronic communication being composed by a sender and an identity of each of one or more recipients to whom the electronic communication is to be sent and the sender are identified. One or more previous electronic communications sent to or received from one or more of the one or more recipients and at least one tone of each of the one or more previous electronic communications are identified in order to generate one or more preferred tones. Responsive to identifying a tone discrepancy between the tone or theme of the at least one attachment and the one or more preferred tones, a notification is presented to the sender.Type: GrantFiled: May 18, 2016Date of Patent: February 25, 2020Assignee: International Business Machines CorporationInventors: Florian Pinel, Edward E. Seabolt
-
Patent number: 10554863Abstract: An image forming apparatus includes a display device, an input device configured to receive a user operation, a preview processing unit configured to display as preview on the display device a document image as an output target, and an output processing unit configured to output the document image. Further, the preview processing unit (a) detects an area surrounded by a predetermined specific color in the document image and displays the document image as preview on the display device so as to make the detected area editable, and (b) add a text specified by the user operation in the detected area in accordance with the user operation. The output processing unit outputs the document image in which the text was added.Type: GrantFiled: May 7, 2018Date of Patent: February 4, 2020Assignee: Kyocera Document Solutions, Inc.Inventor: Koji Tagaki
-
Patent number: 10552107Abstract: In one example of the disclosure, a set of electronic document templates is accessed and instances of duplicated document content are identified. Display of a user notice for first duplicated document content is caused. Responsive to receipt of data indicative of a user instruction to create a component template for the first duplicated content, the component template is created and stored.Type: GrantFiled: December 2, 2015Date of Patent: February 4, 2020Assignee: OPEN TEXT CORPORATIONInventors: James Matthew Downs, Billy R. Kidwell, Anthony Wiley
-
Patent number: 10482876Abstract: A speech interpretation module interprets the audio of user utterances as sequences of words. To do so, the speech interpretation module parameterizes a literal corpus of expressions by identifying portions of the expressions that correspond to known concepts, and generates a parameterized statistical model from the resulting parameterized corpus. When speech is received the speech interpretation module uses a hierarchical speech recognition decoder that uses both the parameterized statistical model and language sub-models that specify how to recognize a sequence of words. The separation of the language sub-models from the statistical model beneficially reduces the size of the literal corpus needed for training, reduces the size of the resulting model, provides more fine-grained interpretation of concepts, and improves computational efficiency by allowing run-time incorporation of the language sub-models.Type: GrantFiled: October 1, 2018Date of Patent: November 19, 2019Assignee: Interactions LLCInventors: Ethan Selfridge, Michael Johnston
-
Patent number: 10460034Abstract: An intention inference system includes, a morphological analyzer to perform morphological analysis for a complex sentence with multiple intentions involved, a syntactic analyzer to perform syntactic analysis for the complex sentence morphologically analyzed by the morphological analyzer and to divide it into the first simple sentence and the second simple sentence, an intention inference unit to infer the first intention involved in the first simple sentence and the second intention involved in the second simple sentence, a feature extractor to extract as the first feature a morpheme showing execution order of operations involved in the first simple sentence and to extract as the second feature a morpheme showing execution order of operations involved in the second simple sentence, and an execution order inference unit to infer the execution order of the first operation corresponding to the first intention and the second operation corresponding to the second intention on the basis of the first feature and theType: GrantFiled: January 28, 2015Date of Patent: October 29, 2019Assignee: Mitsubishi Electric CorporationInventors: Yi Jing, Yusuke Koji, Jun Ishii
-
Patent number: 10452698Abstract: An unstructured data analytics system, including: an unstructured data analytics algorithm resident on a server and accessible via a browser operable for receiving unstructured data from one or more remote sources, applying one or more analytical tools to the unstructured data, and displaying summary information to one or more users; wherein the summary information is displayed to the one or more users in a presentation layer, an exploration layer, and an annotation layer. The unstructured data analytics algorithm is also operable for receiving outside data from one or more remote sources. The presentation layer displays one or more of the unstructured data a summary of the unstructured data, and the summary information. The exploration layers allows the one or more users to modify the granularity of the summary information, thereby modifying the granularity of the presentation layer. The one or more users can interact with the unstructured data analytics system simultaneously via the annotation layer.Type: GrantFiled: May 11, 2016Date of Patent: October 22, 2019Assignee: Stratifyd, Inc.Inventor: Xiaoyu Wang
-
Patent number: 10446137Abstract: Systems, components, devices, and methods for resolving ambiguity in a conversational understanding system are provided. A non-limiting example is a system or method for resolving ambiguity in a conversational understanding system. The method includes the steps of receiving a natural language input and identifying an agent action based on the natural language input. The method also includes the steps of determining an ambiguity value associated with the agent action and evaluating the ambiguity value against an ambiguity condition. The method includes the steps of when determined that the ambiguity value meets the ambiguity condition: selecting a prompting action based on the ambiguity associated with the identified agent action, performing the prompting action, receiving additional input in response to the prompting action, and updating the agent action to resolve the ambiguity based on the additional input. The method also includes the step of performing the agent action.Type: GrantFiled: October 19, 2016Date of Patent: October 15, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Omar Zia Khan, Ruhi Sarikaya, Divya Jetley
-
Patent number: 10444894Abstract: In an example implementation according to aspects of the present disclosure, a method may include capturing data entered on a touch sensitive mat or on an object physically disposed on the touch sensitive mat. The method further includes extracting the data from the captured image, and developing contextual information from the data extracted from the captured image. The method further includes projecting the contextual information onto the touch sensitive mat or onto the object.Type: GrantFiled: September 12, 2014Date of Patent: October 15, 2019Assignee: Hewlett-Packard Development Company, L.P.Inventors: Immanuel Amo, Diogo Lima, Nicholas P Lyons, Arman Alimian
-
Patent number: 10437872Abstract: A computer implemented and computer controlled method of arranging data for processing and storage thereof at a data storage engine. To identified data elements, an action is assigned from a plurality of actions as well as an association between data elements of an action according to a respective topology comprised of an ordered plurality of data categories including a subject data category, an object data category, a spatial data category and a temporal data category. By matching the identified data elements with action topology combinations and using the order of the data elements, one data element is matched with one data category. Instance information is supplemented to matched action topology combinations. In a computer readable format, at a data storage engine, identified data elements, instance information and associations between identifiers resulting from identifying, assigning, matching and supplementing are stored.Type: GrantFiled: May 26, 2017Date of Patent: October 8, 2019Assignee: DYNACTIONIZE N.V.Inventor: Michael Rik Frans Brands
-
Patent number: 10437932Abstract: A determination method executed by a computer including a memory and a processor coupled to the memory, includes receiving a plurality of sentences and designation of terms included in the plurality of sentences, generating, for each term for which the designation is received, information indicating a relation between the term and each of other terms included in one of the plurality of sentences containing the term, extracting, for each term for which the designation is received, information indicating a specific relation from the generated information indicating the relation, generating characteristic information that uses the extracted information as a feature, and determining similarity between a plurality of the terms based on the generated characteristic information for each term.Type: GrantFiled: March 19, 2018Date of Patent: October 8, 2019Assignee: FUJITSU LIMITEDInventors: Kazuo Mineno, Nobuko Takase, Naohiro Itou
-
Patent number: 10423665Abstract: The present teaching relates to generating a conversational agent. In one example, a plurality of input utterances may be received from a developer. A paraphrase model is obtained. The paraphrase model is generated based on machine translation. For each of the plurality of input utterances, one or more paraphrases of the input utterance are generated based on the paraphrase model. For each of the plurality of input utterances, at least one of the one or more paraphrases is selected based on an instruction from the developer to generate selected paraphrases. The conversational agent is generated based on the plurality of input utterances and the selected paraphrases.Type: GrantFiled: August 2, 2017Date of Patent: September 24, 2019Assignee: Oath Inc.Inventors: Ankur Gupta, Timothy Daly, Tularam Ban
-
Patent number: 10423615Abstract: The method includes monitoring a computing device for language settings during user-generated content creation and detect one or more language settings. The method further includes analyzing user-created content to detect a language from a text of the user-generated content. The method further includes compiling a list of scored preferred languages for the computing device based on the detected language settings and the detected language of the text. The method further includes intercepting a query from the computing device. The method further includes analyzing a text of the intercepted query in a plurality of selected languages based on a language setting of a user interface application, a detected language of the query, and a predetermined number of preferred languages of the computing device to produce results of analysis for each selected language. The method further includes generating a multilingual query based on the results of analysis for the selected languages.Type: GrantFiled: June 10, 2016Date of Patent: September 24, 2019Assignee: International Business Machines CorporationInventors: Leonid Bolshinsky, Vladimir Gamaley, Sharon Krisher
-
Patent number: 10417269Abstract: A system and method for verbatim-text mining including parsing documents of a text corpus into a plurality of individual sentences, assigning a sentence identifier to one or more individual sentences of the plurality of individual sentences, generating a plurality of n-Gram strings comprising a plurality of n-Grams from words within the individual sentence, applying an inverted index to the n-Gram string, combining an index data structure of one n-Gram string with an index data structure of another n-Gram string forming a merged index data structure when the index data structure of one n-Gram string shares a predetermined percentage of sentence identifiers of the index data structure of another n-Gram string, assigning a group identifier to the merged index data structure of a one or more merged index data structures, and creating a data set comprising the sentence identifier, the group identifier and the associated n-Gram string.Type: GrantFiled: March 13, 2017Date of Patent: September 17, 2019Assignee: LexisNexis, a division of Reed Elsevier Inc.Inventor: Paul Zhang
-
Patent number: 10409919Abstract: A display method includes reading from a memory a language setting representing an original language and a first target language; detecting a first set of one or more characters input in the original language; recognizing the first set of one or more characters as first text; translating the first text from the original language to the first target language; displaying the translated first text on one or more display areas; translating the translated first text back to the original language; and displaying the first text translated back to the original language on the one or more display areas.Type: GrantFiled: September 28, 2015Date of Patent: September 10, 2019Assignee: Konica Minolta Laboratory U.S.A., Inc.Inventors: Howard Rubin, Isao Hayami
-
Patent number: 10409810Abstract: The method includes monitoring a computing device for language settings during user-generated content creation and detect one or more language settings. The method further includes analyzing user-created content to detect a language from a text of the user-generated content. The method further includes compiling a list of scored preferred languages for the computing device based on the detected language settings and the detected language of the text. The method further includes intercepting a query from the computing device. The method further includes analyzing a text of the intercepted query in a plurality of selected languages based on a language setting of a user interface application, a detected language of the query, and a predetermined number of preferred languages of the computing device to produce results of analysis for each selected language. The method further includes generating a multilingual query based on the results of analysis for the selected languages.Type: GrantFiled: May 8, 2015Date of Patent: September 10, 2019Assignee: International Business Machines CorporationInventors: Leonid Bolshinsky, Vladimir Gamaley, Sharon Krisher
-
Patent number: 10394959Abstract: Machine training for determining sentiments in social network communications. A text document is extracted from a web site and tokenized into tokens. The tokens are input to a word to vector conversion model to generate word vectors. A term frequency inverse document frequency (TF-IDF) algorithm converts the word vectors to sentence vectors. A randomly selected subset the sentence vectors are tagged and used to train a classifier. The classifier takes a sentence vector and predicts a sentiment associated with the sentence vector. Predicted sentiment associated with each of the sentence vectors may be combined to generate a sentiment associated with the text document.Type: GrantFiled: December 21, 2017Date of Patent: August 27, 2019Assignee: International Business Machines CorporationInventors: Ankur Tagra, Rajat Verma, Sudarshan Narayanan
-
Patent number: 10387529Abstract: The present invention may be a method, a system, and/or a computer program product. An embodiment of the present invention provides a method for paraphrasing, on a client computer, text in a webpage, the method comprising the following: transferring a request for a webpage including a plurality of passages of text to a server; receiving the webpage from the server in response to the request; judging whether or not the received webpage has text which is a subject of paraphrase; in a case where the judgment is positive, paraphrasing the text; and displaying, on a display, the webpage including the paraphrased text. Another embodiment of the present invention provides a method for updating on a server, text in a webpage, the method comprising the following: receiving, from each of the devices, a set of URLs of a webpage, a location path of text which is a subject of paraphrase in the webpage, and paraphrased text; and replacing text in the webpage with text among the received text.Type: GrantFiled: February 16, 2017Date of Patent: August 20, 2019Assignee: International Businesss Machines CorporationInventors: Kentaroh Noji, Akihiko Takajo, Yukiko Yasuda
-
Patent number: 10380126Abstract: Systems and methods for content aggregation creation are disclosed herein. The system can include memory having a content database and an aggregation database. The system can include a user device having a first network interface and a first I/O subsystem. The system can include a server that can: provide content to the user device via a first electrical signal; receive a selection of a portion of the provided content from the user device via a second electrical signal; automatically extract sentences from the selected portion of the provided content via a natural language processor; automatically generate a parse tree for one of the automatically extracted sentences; identify noun phrases from the part of speech tags within the parse tree; place content associated with one of the noun phrase in a content aggregation; and output the content aggregation to the user device.Type: GrantFiled: December 13, 2016Date of Patent: August 13, 2019Assignee: PEARSON EDUCATION, INC.Inventors: Sean York, Tim Stewart, David Strong, Scott Hellman, William Murray
-
Patent number: 10372744Abstract: A computer scans a DITA library to identify DITA topic files. The computer then determines whether the identified DITA file has a concept, task, or reference scheme. Based on determining that the identified DITA topic file has a concept scheme, the computer generates a subject taxonomy. Based on determining that the identified DITA topic file has a task scheme, the computer generates a navigation taxonomy. Based on determining that the identified DITA topic file has a reference scheme, the computer generates a command relational taxonomy. Based on the generated subject, navigation, and command relational taxonomies, the computer generates a DITA file relationship table based on the contextual taxonomy density of the aforementioned taxonomies.Type: GrantFiled: June 3, 2016Date of Patent: August 6, 2019Assignee: International Business Machines CorporationInventors: Balaji S. Kumar, Vishal G. Palliyathu, Harpreet Singh
-
Patent number: 10354646Abstract: A third sentence obtained by replacing a first phrase of a first sentence with a second phrase is input, and it is judged whether a third phrase is included in a first database including at least a phrase used in written text. If the third phrase is not included, a first evaluation value in the first database is calculated for a seventh phrase obtained by replacing the second phrase of the third phrase with a sixth phrase. It is judged whether the third phrase is included in a second database including at least a phrase used in spoken text and whether a second evaluation value calculated from the first evaluation value satisfies a predetermined condition. If the third phrase is included, and the second evaluation value satisfies the predetermined condition, the third sentence and the second sentence as a pair are added to a bilingual corpus.Type: GrantFiled: August 29, 2017Date of Patent: July 16, 2019Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.Inventors: Nanami Fujiwara, Masaki Yamauchi, Masahiro Imade
-
Patent number: 10354010Abstract: An information processing system to increase weights of words that are related to a text, but that do not explicitly occur in the text, in a weight vector representing the text, is provided. An adjusting system (100) includes a distance storing unit (110) and an adjusting unit (120). The distance storing unit (110) stores distances between any two terms of a plurality of terms. The distance between two terms becomes smaller as the two terms are semantically more similar. The adjusting unit (120) adjusts a weight of each term of the plurality of terms in a weight vector including weights of the plurality of terms and representing a text, on the basis of a distance between each term and other term in the weight vector and a weight of the other term.Type: GrantFiled: April 24, 2015Date of Patent: July 16, 2019Assignee: NEC CORPORATIONInventors: Daniel Georg Andrade Silva, Akihiro Tamura, Masaaki Tsuchida
-
Patent number: 10339029Abstract: A method of detecting potential internationalization issues in source code may include installing a plug-in component in a stand-alone static source code analysis program/application that is configured to enable detection of internationalization issues in source code. The method may also include automatically creating a repository comprising a plurality of internationalization rules for a plurality of programming languages that are provided by the plug-in and accessing a subset of the plurality of internationalization rules corresponding to a particular programming language of the plurality of programming languages. The method may include creating a quality profile for the particular programming language using the subset of the plurality of internationalization rules, scanning source code of a software product for potential issues on a block level, and identifying and displaying the detected internationalization issues in the source code.Type: GrantFiled: November 1, 2016Date of Patent: July 2, 2019Assignee: CA, Inc.Inventors: Kumar Arnesh Rameshwar, Guru Prasadareddy Narapu Reddy
-
Patent number: 10331681Abstract: Implementations provide an improved system for presenting search results based on entity associations of the search items. An example method includes, for each of a plurality of crowdsource workers, initiating display of a first randomly selected cluster set from a plurality of cluster sets to the crowdsource worker. Each cluster set represents a different clustering algorithm applied to a set of search items responsive to a query. The method also includes receiving cluster ratings for the first cluster set from the crowdsource worker and calculating a cluster set score for the first cluster set based on the cluster ratings. This is repeated for remaining cluster sets in the plurality of cluster sets. The method also includes storing a cluster set definition for a highest scoring cluster set, associating the cluster set definition with the query, and using the definition to display search items responsive to the query.Type: GrantFiled: April 11, 2016Date of Patent: June 25, 2019Assignee: GOOGLE LLCInventors: Jilin Chen, Amy Xian Zhang, Sagar Jain, Lichan Hong, Ed Huai-Hsin Chi
-
Patent number: 10324971Abstract: A method for classifying a new instance including a text document by using training instances with class including labeled data and zero or more training instances with class including unlabeled data, comprising: estimating a word distribution for each class by using the labeled data and the unlabeled data; estimating a background distribution and a degree of interpolation between the background distribution and the word distribution by using the labeled data and the unlabeled data; calculating two probabilities for that the word generated from the word distribution and the word generated from the background distribution; combining the two probabilities by using the interpolation; combining the resulting probabilities of all words to estimate a document probability for the class that indicates the document is generated from the class; and classifying the new instance as a class for which the document probability is the highest.Type: GrantFiled: June 20, 2014Date of Patent: June 18, 2019Assignee: NEC CorporationInventors: Daniel Georg Andrade Silva, Hironori Mizuguchi, Kai Ishikawa
-
Patent number: 10325026Abstract: A technique for generating a new equivalent phrase for an input phrase includes receiving a first input phrase for natural language expansion. Tokens that correspond to parts of speech are generated for the first input phrase. An original grammar tree is generated using at least some of the tokens. One or more alternate grammar trees are generated that are different from the original grammar tree but substantially equivalent to the original grammar tree. One or more synonyms for at least one of the tokens are generated. Finally, one or more new phrases are generated based on the one or more alternate grammar trees and the one or more synonyms.Type: GrantFiled: September 25, 2015Date of Patent: June 18, 2019Assignee: International Business Machines CorporationInventor: Bryan D. Cardillo
-
Patent number: 10289667Abstract: Computer-program products and methods for automatically annotating terms, such as ambiguous terms, in an electronic text document are disclosed. In one embodiment, a method of annotating a text document includes determining, by a computing device, a term of interest within the text document. The method further includes searching a data structure including incongruous term pairs (tx, tt) determined from a controlled vocabulary for the term of interest appearing as a term tt, wherein the term tt is a linguistic head of a term tx of the incongruous term pairs (tx, tt). The method further includes annotating the term of interest with a meaning provided by the controlled vocabulary only if a term tx of the incongruous term pairs (tx, tt) associated with the term of interest in the data structure is not present within a predetermined textual distance of the term of interest in the text document.Type: GrantFiled: September 6, 2016Date of Patent: May 14, 2019Assignee: Elsevier B.V.Inventors: Marius Doornenbal, Inga Kohlhof
-
Patent number: 10275459Abstract: A content management system (CMS) and a translation management system (TMS) can utilize content dimensions for content items to manage and translate the content items between languages. Machine and human translations of complex dynamic content can also be improved by pre-rendering the content to remove localization-related syntax prior to machine or human translation. Content items can also be scored as to their suitability for localization prior to translation, and translation can be skipped for content items that do not have a sufficiently high score. Semantic and natural language processing (NLP) techniques can also be utilized for content categorization and routing. Translations of content items can also be continuously refined and higher quality re-translated content can be provided in an automated fashion.Type: GrantFiled: September 28, 2016Date of Patent: April 30, 2019Assignee: Amazon Technologies, Inc.Inventors: Paul Kasper, Pallami Bhattacharjee, Paul Christopher Cerda, William Joseph Kaper, Thibault Pierre Seillier, Kelly Duggar Wiggins
-
Patent number: 10275449Abstract: A computing device automatically creates a log record recognizer expression and uses the log record recognizer expression to identify a log record type for a log record to parse the log record. A log record type regular expression is selected from log record type regular expressions and is separated into subexpressions that are normalized and are reassembled into an expression recognizer for each log record type regular expression. The expression recognizer for each is read into a data structure. The recognizer expressions are sorted based on an order associated with an expression operator of each subexpression. A log recognizer expression is created from each read expression recognizer included in the sorted recognizer expressions. A log record type of a log record is identified using the created log recognizer expression. A log record type regular expression is selected. The log record is parsed using the selected log record type regular expression.Type: GrantFiled: October 3, 2018Date of Patent: April 30, 2019Assignee: SAS INSTITUTE INC.Inventors: Keefe Hayes, Robert N. Bonham
-
Patent number: 10270723Abstract: Systems, methods, and non-transitory computer-readable media can acquire a specified set of one or more entities associated with a user of a social networking system. A collection of content items provided by the specified set of one or more entities can be detected. One or more content items that are unread by the user can be identified out of the collection of content items. The one or more content items unread by the user can be sorted, in a chronological order, to produce a sorted set of one or more unread content items. An interface can be provided to the user for accessing the sorted set of one or more unread content items.Type: GrantFiled: March 23, 2015Date of Patent: April 23, 2019Assignee: Facebook, Inc.Inventors: Gregory Matthew Marra, Michael Novati, Zhiqiu Kong
-
Patent number: 10255300Abstract: Automatically extracting profile feature attribute data from event data is disclosed, including: receiving a set of event data; receiving a feature associated with a profiling technique; determining that a plurality of events associated with a user in the set of event data corresponds to a first attribute corresponding to the feature, wherein the first attribute corresponds to a first bin having a first defined value; determining that the plurality of events associated with the user in the set of event data corresponds to a second attribute corresponding to the feature, wherein the second attribute corresponds to a second bin having a second defined value; and creating a user record corresponding to the user indicating presence in the first bin and the second bin.Type: GrantFiled: May 14, 2015Date of Patent: April 9, 2019Assignee: Google LLCInventors: Anant Deep Jhingran, Krishna Kumar Kesavan, Joy Aloysius Thomas, Jagdish Chand, Sridhar Rajagopalan
-
Patent number: 10229108Abstract: A system and method for adaptive spell checking and correction. The method includes tracking frequencies of historical replacement strings of characters, and providing a list of “n” number of the historical replacement strings of characters in response to a string of characters which were previously changed or are not recognized.Type: GrantFiled: January 14, 2016Date of Patent: March 12, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: William K. Bodin, Gregory J. Boss, Rick A. Hamilton, II, John S. Langford
-
Patent number: 10216940Abstract: Methods, apparatuses, and computer program products are provided for truncated, encrypted searching of encrypted identifiers. A method may include receiving patient information associated with a plurality of patients and including a patient identifier of a sequence of characters for each of the plurality of patients. Methods may further include: extracting a first subset of the sequence of characters from each of the patient identifiers; encrypting the first subset of the sequence of characters from each of the patient identifiers to form a first truncated encrypted identifier for each of the plurality of patients; encrypting each of the patient identifiers to create an encrypted patient identifier for each of the plurality of patients; and storing the first truncated encrypted identifiers and the encrypted patient identifiers for each of the plurality of patients.Type: GrantFiled: March 27, 2015Date of Patent: February 26, 2019Assignee: CHANGE HEALTHCARE HOLDINGS, LLCInventor: Mike Zwinger
-
Patent number: 10198245Abstract: Systems and methods are provided for determining user interface controls associated with hierarchical levels during media content playback. In some embodiments, instructions may be provided to display user interface controls associated with a level of media content, such as a chapter level. A change in the level of media content and/or the user interface controls associated with the level may be a response to a level change request. A level change request may be initiated by user input and/or may automatically be initiated by the current content playback position.Type: GrantFiled: May 9, 2014Date of Patent: February 5, 2019Assignee: Audible, Inc.Inventors: Timothy Jaeger, Alison Mae Go
-
Patent number: 10186256Abstract: Typical speech recognition systems usually use speaker-specific speech data to apply speaker adaptation to models and parameters associated with the speech recognition system. Given that speaker-specific speech data may not be available to the speech recognition system, information indicative of language skills is employed in adapting configurations of a speech recognition system. According to at least one example embodiment, a method and corresponding apparatus, for speech recognition comprise maintaining information indicative of language skills of users of the speech recognition system. A configuration of the speech recognition system for a user is determined based at least in part on corresponding information indicative of language skills of the user. Upon receiving speech data from the user, the configuration of the speech recognition system determined is employed in performing speech recognition.Type: GrantFiled: January 23, 2014Date of Patent: January 22, 2019Assignee: Nuance Communications, Inc.Inventors: Weiying Li, Daniel Willett
-
Patent number: 10180940Abstract: A translation method is disclosed herein. The method includes determining a target object to be translated, the target object including a plurality of elements; dividing the target object to be translated according to a language correspondence relationship to obtain at least one element set; determining a weight value of a second object corresponding to each first object in each element set according to the language correspondence relationship; determining a comparison value associated with each element set according to the determined weight value and selecting an element set with the maximum comparison value; determining a second object with the maximum weight value corresponding to each first object in the selected element set according to the correspondence relationship, combining all the determined second objects to form a translation content of the target object.Type: GrantFiled: September 22, 2016Date of Patent: January 15, 2019Assignee: Alibaba Group Holding LimitedInventors: Hongfei Jiang, Jun Lu, Weihua Luo, Feng Lin
-
Patent number: 10176166Abstract: Mechanisms for identifying hidden meaning in a portion of natural language content are provided. A primary portion of natural language content is received and a secondary portion of natural language content is identified that references the natural language content. The secondary portion of natural language content is analyzed to identify indications of meaning directed to elements of the primary portion of natural language content. A probabilistic model is generated based on the secondary portion of natural language content modeling a probability of hidden meaning in the primary portion of natural language content. A hidden meaning statement data structure is generated for the primary portion of natural language content based on the probabilistic model.Type: GrantFiled: September 1, 2017Date of Patent: January 8, 2019Assignee: International Business Machines CorporationInventors: Donna K. Byron, Benjamin L. Johnson, Lakshminarayanan Krishnamurthy, Krishna Kummamuru, Timothy P. Winkler
-
Patent number: 10169454Abstract: A method for extracting a relations graph uses an ontology graph in which nodes represent entity classes or concepts and edges represent properties of the classes. A property is associated with a constraint which defines a range of values that can be taken without incurring a cost. Input text in which entity and concept mentions are identified is received. An optimal set of alignments between a subgraph of the ontology graph and the identified mentions is identified by optimizing a function of constraint costs incurred by the alignments and a distance measure computed over the set of alignments. The relations graph is generated, based on the optimal set of alignments. The relations graph represents a linked set of relations instantiating a subgraph of the ontology. The relations graph can include relations involving implicit mentions corresponding to subgraph nodes that are not aligned to any of the concept or entity mentions.Type: GrantFiled: May 17, 2016Date of Patent: January 1, 2019Assignee: XEROX CORPORATIONInventors: Salah Ait-Mokhtar, Vassilina Nikoulina
-
Patent number: 10169322Abstract: A method includes receiving, at a processor, a request to construct a word entry of a word. The method further includes collecting, by the processor, a user profile. The method further includes selecting, by the processor, one or more definition databases according to the user profile. The method further includes retrieving, by the processor, definitions of the word from the definition databases. The method further includes ranking, by the processor, the definitions retrieved from the definition databases.Type: GrantFiled: May 3, 2016Date of Patent: January 1, 2019Assignee: Dinky Labs, LLCInventors: Alan Rulin Liu, Gina Inan Liu
-
Patent number: 10146770Abstract: A mechanism is provided in a data processing system comprising at least one processor and at least one memory, the at least one memory comprising instructions executed by the at least one processor to cause the at least one processor to implement a cognitive system for capturing referential information. The cognitive system receives a first indication that a group text messaging conversation is in a muted state for a first user. The cognitive system detects a first use of a referential phrase in the group text messaging conversation during a first time period when the group text messaging conversation is in the muted state. The cognitive system receives a second indication that the group text messaging conversation is in a non-muted state. The cognitive system detects a second use of the referential phrase in the group text messaging conversation during a second time period when the group text messaging conversation is in the non-muted state. The second time period is subsequent to the first time period.Type: GrantFiled: December 1, 2016Date of Patent: December 4, 2018Assignee: International Business Machines CorporationInventors: Robert H. Grant, Jeremy A. Greenberger, Trudy L. Hewitt, Joseph Lam, Francesco C. Schembari
-
Patent number: 10147424Abstract: The present disclosure includes techniques for selecting a response to an audio stream query. In one embodiment, an application server receives an audio stream query including content spoken by a user interacting with a voice-user interface. The application server determines a set of paralinguistic features from the audio stream query, and estimates at least a first attribute of the user based on the set of paralinguistic features. The application server identifies subject matter corresponding to the spoken content in the audio stream query, and determines two or more query responses corresponding to the identified subject matter. The application server then selects one of the query responses to present to the user based, at least in part, on the attribute of the user estimated from the set of paralinguistic features.Type: GrantFiled: October 26, 2016Date of Patent: December 4, 2018Assignee: INTUIT INC.Inventors: Benjamin Indyk, Igor A. Podgorny, Raymond Chan
-
Patent number: 10133965Abstract: The invention refers to a method for text recognition, wherein the method is executed by a processor of a computing device and comprises steps of providing a confidence matrix, wherein the confidence matrix is a digital representation of an input sequence, entering a regular expression, searching for a symbol sequence of the input sequence that matches the regular expression, wherein a score value is computed by the processor using confidence values of the confidence matrix, wherein the score value is an indication of the quality of the matching between the symbol sequence of the input sequence and the regular expression. Further, the invention relates to a computer program product which when executed by a processor of a computing device performs the method.Type: GrantFiled: December 2, 2015Date of Patent: November 20, 2018Assignee: Planet A1 GbmHInventor: Welf Wustlich