Dictionary Building, Modification, Or Prioritization Patents (Class 704/10)
  • Patent number: 10324967
    Abstract: A system for performing semantic search receives an electronic text corpus and separates the text corpus into a plurality of sentences. The system parses and converts each sentence into a sentence tree. The system receives a search query and matches the search query with one or more of the sentence trees.
    Type: Grant
    Filed: November 16, 2017
    Date of Patent: June 18, 2019
    Assignee: Oracle International Corporation
    Inventors: Vladimir Zelevinsky, Yevgeniy Dashevsky, Diana Ye
  • Patent number: 10325215
    Abstract: Systems and methods for content aggregation creation are disclosed herein. The system can include memory having a content database and an aggregation database. The system can include a user device having a first network interface and a first I/O subsystem. The system can include a server that can: provide content to the user device via a first electrical signal; receive a selection of a portion of the provided content from the user device via a second electrical signal; automatically extract sentences from the selected portion of the provided content via a natural language processor; automatically generate a parse tree for one of the automatically extracted sentences; identify noun phrases from the part of speech tags within the parse tree; place content associated with one of the noun phrase in a content aggregation; and output the content aggregation to the user device.
    Type: Grant
    Filed: August 12, 2016
    Date of Patent: June 18, 2019
    Assignee: Pearson Education, Inc.
    Inventors: Sean York, Tim Stewart, David Strong, Scott Hellman, William Murray
  • Patent number: 10318996
    Abstract: An auction apparatus according to the application includes a plan receiving unit and an auction providing unit. The plan receiving unit receives plan information indicating a plan of content from a content creator who creates the content. The auction providing unit presents the plan information received by the plan receiving unit and provides an auction service in which bid targets are a right to display an advertisement in an advertising space that is set to the content and a right to replace the advertisement displayed in the advertising space.
    Type: Grant
    Filed: September 6, 2013
    Date of Patent: June 11, 2019
    Assignee: YAHOO JAPAN CORPORATION
    Inventors: Toru Takata, Nobuyuki Akashi, Akira Tajima, Koji Tsukamoto, Nari Woo, Akihiro Tanabe, Shinichiro Sega, Rin Katou
  • Patent number: 10319254
    Abstract: In one implementation, a computer-implemented method for providing a graphical user interface (GUI) that teaches correct placement of accent marks on Spanish language words through a particular sequence of graphical elements includes receiving, through a GUI presented by a computing device, a selection of a graphical word element corresponding to a word to test as part of a Spanish language teaching program; identifying, by the computing device, a particular word set that corresponds to the word from among a plurality of word sets; and outputting, in the GUI presented by the computing device and based on the particular word set, a first graphical prompt element for the user to identify whether the word has a particular ending characteristic that corresponds to the particular word set.
    Type: Grant
    Filed: June 15, 2016
    Date of Patent: June 11, 2019
    Inventor: Joel Lane Mayon
  • Patent number: 10311065
    Abstract: Scoring candidate evidence passages for criteria validation. Evidence data associated with a criteria, such that the evidence data entries include a decision indicator indicating that the criteria is either met or not met by the evidence data, is collected. Candidate evidences, making up a corpus of data associated with the criteria, against which the criteria is to be validated, are generated. Each candidate evidence is evaluated against the evidence data. A score indicating the validity of the criteria with respect to the candidate evidence is generated, based on the decision indicators associated with the evidence data entries.
    Type: Grant
    Filed: December 1, 2015
    Date of Patent: June 4, 2019
    Assignee: International Business Machines Corporation
    Inventors: Lalit Agarwalla, Ankur Parikh, Avinesh Polisetty Venkata Sai
  • Patent number: 10311867
    Abstract: According to an embodiment, a tagging support apparatus includes a first acquirer, an estimator, a first storage, a second acquirer, and a presenter. The first acquirer acquires a spoken sentence concerning an utterance of a user. The estimator estimates an utterance intention of the spoken sentence to obtain one or more intention candidates of the utterance intention. The first storage stores an intention system having a hierarchical structure of intentions used in a dialog system. The second acquirer acquires, based on the intention candidates, part of the intention system as one or more hierarchical intention candidates. The presenter presents the hierarchical intention candidates.
    Type: Grant
    Filed: January 31, 2017
    Date of Patent: June 4, 2019
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Masahiro Ito, Kenji Iwata
  • Patent number: 10289653
    Abstract: A system, and computer program product for adapting tabular data for narration are provided in the illustrative embodiments. A set of categories used to organize data is identified in a first tabular portion of a document. A structure of the categories is analyzed. An inference is drawn about data in a first cell in the first tabular portion based on a position of the first cell in the structure. The first tabular portion of the document is transformed into a first narrative form using the inference.
    Type: Grant
    Filed: March 15, 2013
    Date of Patent: May 14, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Donna K. Byron, Alexander Pikovsky, Matthew B. Sanchez
  • Patent number: 10289680
    Abstract: Systems and methods of natural language processing in an environment with no existing corpus are disclosed. The method includes defining an input grammar specific to a chosen domain, the input grammar having a domain specific knowledge and general grammatical knowledge. Groups of tokens are identified within the input grammar having syntactic and semantic equivalence. The identified groups are assembled into hypernyms, wherein the hypernyms include a semantic output for each token in the hypernyms. A list of fields is then combined with the hypernyms for combination with the hypernyms. A corpus of possible combinations of hypernyms and fields is created. A data structure mapping each possible combination to a partial semantic output is generated and the data structure is saved for use in later processing.
    Type: Grant
    Filed: May 31, 2016
    Date of Patent: May 14, 2019
    Assignee: Oath Inc.
    Inventors: Jonathan R. Scally, Richard Caneba, Nick Cassimatis
  • Patent number: 10282415
    Abstract: Aspects of the present disclosure include a system comprising a machine-readable storage medium storing at least one program and computer-implemented methods for detecting a language of a text string. Consistent with some embodiments, the method may include applying multiple language identification models to a text string. Each language identification model provides a predicted language of the text string and a confidence score associated with the predicted language. The method may further include weighting each associated confidence score based on historical performance of the corresponding language identification model in predicting languages of other text strings. The method may further include selecting a predicted language of the text string from among the multiple predicted languages provided by the multiple language identification models based on a result of the weighting of the confidence score associated with the particular predicted language.
    Type: Grant
    Filed: November 29, 2016
    Date of Patent: May 7, 2019
    Assignee: eBay Inc.
    Inventors: Akshay Gupta, Hrishikesh Joshi, Saiyam Kohli, Vidit Aggarwal
  • Patent number: 10282470
    Abstract: This disclosure describes a distributed reader architecture for a mobile computing device such as cellular telephone handset.
    Type: Grant
    Filed: December 8, 2017
    Date of Patent: May 7, 2019
    Assignee: Digimarc Corporation
    Inventors: Ravi K. Sharma, Tony F. Rodriguez
  • Patent number: 10268828
    Abstract: An encryption platform may identify language characters in database fields and determine which of multiple encoding schemes more efficiently encodes the language characters. The encryption platform may use the selected encoding scheme to encode plaintext into a byte array and then encrypt the byte array into ciphertext. The encryption platform may assign encoding flags to the ciphertext identifying the encoding scheme used for encoding the plaintext. The encryption platform then may use the encoding flag to determine which encoding schemes to use for decoding the data back into plaintext. Using the different encoding schemes allows encrypted data to fit better into database storage structures. The encryption platform also may enable users to select different fields for different selectable objects for encoding and encrypting.
    Type: Grant
    Filed: April 4, 2018
    Date of Patent: April 23, 2019
    Assignee: SALESFORCE.COM, INC.
    Inventor: Patrick Calahan
  • Patent number: 10268756
    Abstract: An approach is provided for classifying natural language input into plurality of query processing levels. The approach involves parsing at least one natural language input into one or more words, wherein the at least one natural language input represents at least one cartographic query. The approach also involves mapping the one or more words to at least one function of a cartography system. The approach further involves determining an availability of the at least one function, one or more parameters associated with the at least one function, or a combination thereof with respect to at least one component of the cartography system. The approach also involves classifying the at least one cartographic query into at least one of a plurality of query processing levels based, at least in part, on the availability.
    Type: Grant
    Filed: December 18, 2015
    Date of Patent: April 23, 2019
    Assignee: HERE Global B.V.
    Inventor: Volodimir Burlik
  • Patent number: 10255275
    Abstract: Implementations herein relate to methods and devices for generating candidate translations and for quantizing text as well as words. A method may include generating, by a computing device, pending candidate translations of text to be translated based on predetermined translation rules. The computing device may generate translation probabilities from the text to be translated to the pending candidate translations based on features having impacts on translation probabilities of the pending candidate translations and a predetermined translation probability prediction model. The computing device may then select a predetermined number of pending candidate translations that have the translation probabilities higher than other pending candidate translations in the pending candidate translations to be the candidate translations of the text to be translated.
    Type: Grant
    Filed: August 25, 2016
    Date of Patent: April 9, 2019
    Assignee: Alibaba Group Holding Limited
    Inventors: Kai Song, Feng Lin, Weihua Luo
  • Patent number: 10255001
    Abstract: Disclosed is a method of provisioning electronic forms based on natural language. The method includes receiving, using a communication device, a natural language input from a builder device, wherein the natural language input represents one or both of a requested data and a presented data associated with a legal process. Further, the method includes analyzing, using a processing device, the natural language input. Yet further, the method includes generating, using the processing device, an electronic form based on the analyzing of the natural language input, wherein the electronic form comprises an input field configured to receive the requested data and an output field configured to present the presented data. Moreover, the method includes storing, using a storage device, the electronic form. Furthermore, the method includes transmitting, using the communication device, the electronic form to one or more user devices.
    Type: Grant
    Filed: June 6, 2017
    Date of Patent: April 9, 2019
    Inventor: Morgan Warstler
  • Patent number: 10255266
    Abstract: A relay apparatus includes a characteristic information storage unit, a first dictionary information storage unit, a first selection unit, and a first conversion unit. The characteristic information storage unit stores pieces of terminal information in association with respective pieces of characteristic information, each indicating a characteristic of a user. The first dictionary information storage unit stores the pieces of characteristic information in association with respective pieces of first dictionary information used for converting voice data into text data. The first selection unit selects, when voice data is received from a second communication terminal to communicate with a first communication terminal, the characteristic information associated with the terminal information for identifying the second communication terminal and selects the first dictionary information associated with the selected characteristic information.
    Type: Grant
    Filed: December 3, 2014
    Date of Patent: April 9, 2019
    Assignee: RICOH COMPANY, LIMITED
    Inventor: Tomohide Takano
  • Patent number: 10249297
    Abstract: Examples of the present disclosure describe processing by an input understanding system/service. A received input is processed to generate a set of alternatives for recognizing the received input. The set of alternatives is filtered. Filtering comprises ranking the set of alternatives and propagating a plurality of the ranked alternatives for additional processing. The propagated alternatives are processed to generate an expanded set of alternatives for potential hypotheses based on the received input. The expanded set of alternatives is filtered. Filtering comprises ranking alternatives of the expanded set and propagating a plurality of the ranked alternatives of the expanded set for additional processing. The propagated alternatives of the expanded set are evaluated based on application of knowledge data fetched from external resources. A response to the received input is generated.
    Type: Grant
    Filed: July 13, 2015
    Date of Patent: April 2, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Omar Zia Khan, Ruhi Sarikaya
  • Patent number: 10248727
    Abstract: There are disclosed a system and method for providing a word search service based on message according to the present invention. The system for providing a word search service based on message according to the present invention includes a database which is equipped with a plurality of data modules and registers sentence data and word extracted from the sentence data in each data module equipped to manage the sentence data and the word; and a service server which, if receiving a search word from a user terminal, broadcasts the received search word to all data modules and is provided with a search result from each of all data modules in response to broadcasting and thereafter provides the provided search result to the user terminal.
    Type: Grant
    Filed: August 4, 2014
    Date of Patent: April 2, 2019
    Inventor: Young Hyun Bae
  • Patent number: 10250538
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage medium, to facilitate interception of messages that include offensive content. In one aspect, a method includes actions of receiving input on a user device that includes message content, determining, on the user device, whether the message content includes offensive content, and in response to determining, on the user device, that the message content includes offensive content, generating an alert message for display on the user device that provides an indication that the message includes offensive content.
    Type: Grant
    Filed: June 20, 2016
    Date of Patent: April 2, 2019
    Inventor: Trisha N. Prabhu
  • Patent number: 10248722
    Abstract: A system with methodology for providing multi-language support for dynamic ontology. In one embodiment, for example, a method comprises: storing an ontology for a data store, wherein the ontology comprises a plurality of data types; for each data type of the plurality of data types: storing, in the ontology, one or more display values, wherein each display value is associated with a locale; determining a locale identifier corresponding to a particular locale; selecting a particular display value corresponding to the particular locale; displaying the particular display value.
    Type: Grant
    Filed: February 22, 2016
    Date of Patent: April 2, 2019
    Assignee: PALANTIR TECHNOLOGIES INC.
    Inventors: David Leblanc, John Butters, Chris Cook
  • Patent number: 10235355
    Abstract: An input method editor (IME) is associated with a local user. Memory stores local data and a processor, coupled to the memory, is configured to receive input from a local, first user, obtain shared data associated with at least a remote, second user from a remote server and generate prediction candidates and conversion candidates based on the input provided by the local, first user and correlation of the input and the obtained shared data.
    Type: Grant
    Filed: July 31, 2017
    Date of Patent: March 19, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Dong Li, Xi Chen, Yoshiharu Sato, Keita Ooi
  • Patent number: 10235461
    Abstract: Systems and methods are provided for identifying relevant information for an entity, referred to as a seed entity. A plurality of search queries can be generated each comprising a property of a seed entity or one of the entities associated with the seed entity (seed-linked entities). Preferably, a collection of search queries includes ones representing different properties of the seed entity and properties of different seed-linked entities. Optionally, the collection of search queries is optimized to reduce search burden. Searches can then be conducted with the search queries in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit entity and one or more entities associated with the hit entity (hit-linked entity).
    Type: Grant
    Filed: May 2, 2017
    Date of Patent: March 19, 2019
    Assignee: Palantir Technologies Inc.
    Inventors: Matthew Elkherj, Ashley Einspahr, Breanna Bunge, Chris Hammett, Erika Crawford Tom, Mitchell Beard, Ryan Beiermeister, Seelig Sinton, Sharon Hao, William Ayers, Seth Robinson
  • Patent number: 10229674
    Abstract: Technologies are described herein for cross-language speech recognition and translation. An example method of speech recognition and translation includes receiving an input utterance in a first language, the input utterance having at least one name of a named entity included therein and being pronounced in a second language, utilizing a customized language model to process at least a portion of the input utterance, and identifying the at least one name of the named entity from the input utterance utilizing a phonetic representation of the at least one name of the named entity. The phonetic representation has a pronunciation of the at least one name in the second language.
    Type: Grant
    Filed: May 15, 2015
    Date of Patent: March 12, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Arul A. Menezes, Hany M. Hassan Awadalla
  • Patent number: 10223067
    Abstract: An environmental context of a user may be taken into account to enhance the user's communication throughput. An “environmental context” can include spatial surroundings of a user, device, and/or sensor of the device and more broadly to denote the context of the user in a multiplicity of environments such as, for example, the surroundings of a user, a digital environment such as the user or other individuals' interactions with or made near a device, etc. The techniques can include obtaining contextual data to provide context-predicted suggestions of words and/or phrases that a user can select to be output on the user's behalf. In some examples, the techniques can also use contextual data to weight, sort, rank, and/or filter word and/or phrase suggestions.
    Type: Grant
    Filed: July 15, 2016
    Date of Patent: March 5, 2019
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Meredith Morris, Shaun K. Kane
  • Patent number: 10216731
    Abstract: A system and method for translating data from a source language to a target language is provided wherein machine generated target translation of a source sentence is compared to a database of human generated target sentences. If a matching human generated target sentence is found, the human generated target sentence may be used instead of the machine generated sentence, since the human generated target sentence is more likely to be a well-formed sentence than the machine generated sentence. The system and method does not rely on a translation memory containing pairs of sentences in both source and target languages, and minimizes the reliance on a human translator to correct a translation generated by machine translation.
    Type: Grant
    Filed: January 19, 2017
    Date of Patent: February 26, 2019
    Assignee: SDL Inc.
    Inventors: Shang-Che Cheng, Alexander Pressman, Hong Zhang, Pei Chiang Ma, Shuan Zhang, Jochen Hummel
  • Patent number: 10216832
    Abstract: A natural language processing system has a hierarchy of user intents related to a domain of interest, the hierarchy having specific intents corresponding to leaf nodes of the hierarchy, and more general intents corresponding to ancestor nodes of the leaf nodes. The system also has a trained understanding model that can classify natural language utterances according to user intent. When the understanding model cannot determine with sufficient confidence that a natural language utterance corresponds to one of the specific intents, the natural language processing system traverses the hierarchy of intents to find a more general user intent that is related to the most applicable specific intent of the utterance and for which there is sufficient confidence. The general intent can then be used to prompt the user with questions applicable to the general intent to obtain the missing information needed for a specific intent.
    Type: Grant
    Filed: December 19, 2016
    Date of Patent: February 26, 2019
    Assignee: Interactions LLC
    Inventors: Srinivas Bangalore, John Chen
  • Patent number: 10192543
    Abstract: A method (300) and system (100) is provided to add the creation of examples at a developer level in the generation of Natural Language Understanding (NLU) models, tying the examples into a NLU sentence database (130), automatically validating (310) a correct outcome of using the examples, and automatically resolving (316) problems the user has using the examples. The method (300) can convey examples of what a caller can say to a Natural Language Understanding (NLU) application. The method includes entering at least one example associated with an existing routing destination, and ensuring an NLU model correctly interprets the example unambiguously for correctly routing a call to the routing destination. The method can include presenting the example sentence in a help message (126) within an NLU dialogue as an example of what a caller can say for connecting the caller to a desired routing destination.
    Type: Grant
    Filed: May 10, 2016
    Date of Patent: January 29, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Rajesh Balchandran, Linda M. Boyer, James R. Lewis, Brent D. Metz
  • Patent number: 10191903
    Abstract: A user context generator determines one or both of a location of a user and contextual information for the user. The contextual information is indicative of content of interest to the user. A custom content generator engine generates customized translated content for the user. Generating the customized translated content includes selecting, from translated content stored in a database, based on the one or both of the determined location of the user and the determined contextual information for the user, translated content to be presented to the user. The customized translated content includes a set of phrases in a source language and corresponding translations of phrases, in the set of phrases, from the source language to a target language. The selected translated content is displayed to the user, such that the user is provided with translated content of interest to the user.
    Type: Grant
    Filed: September 30, 2016
    Date of Patent: January 29, 2019
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: William Duncan Lewis, Vishal Chandulal Chowdhary, Tanvi Saumil Surti
  • Patent number: 10191942
    Abstract: A token-based database management system described herein may reduce an amount of comparisons during entity resolution of records. The system includes a token creator configured to create tokens from records, a token-record mapping creator configured to create a token-record mapping of tokens to records, a token importance calculator configured to calculate token importance values for the tokens, a token pruner configured to identify a token of the current record as unimportant based on token importance values of the tokens of the current record, and to remove the unimportant token from the token-record mapping, a record selector configured to select only records sharing at least one common token with the current record, and a record comparator configured to compare the current record with each of the selected records to determine whether the current record matches any of the selected records.
    Type: Grant
    Filed: October 14, 2016
    Date of Patent: January 29, 2019
    Assignee: SAP SE
    Inventors: Tim Parker, Jim Havlicek
  • Patent number: 10191946
    Abstract: According to one exemplary embodiment, a method for finding an answer to a query from a table is provided. The method may include receiving the query and the table. The method may also include generating a hierarchical representation based on the received table, whereby the hierarchical representation comprises a primary tree and a secondary tree. The method may then include mapping the primary tree to the secondary tree. The method may further include generating a plurality of predicate triples in a semantic relationship form based on the primary tree and the secondary tree. The method may also include determining if a predicate triple within the plurality of predicate triples matches the query using query-side and table-side context and semantics. The method may then include adding the predicate triple within the plurality of predicate triples to a plurality of candidate answers based on determining that the predicate triple matches the query.
    Type: Grant
    Filed: March 11, 2015
    Date of Patent: January 29, 2019
    Assignee: International Business Machines Corporation
    Inventors: Chinnappa Guggilla, Prabhakar R. Majjiga, Praveen K. Midde, Avinesh Polisetty Venkata Sai
  • Patent number: 10176165
    Abstract: Disambiguation in mention detection. The method includes: determining at least one location in a text at which a target surface form in the text appears; obtaining an overall word-bag context of the target surface form in the text, the word-bag context at each of the at least one location including words within a predetermined neighborhood of the location; obtaining an overall resource context of the target surface form in the text, the resource context at each of the at least one location including resources corresponding to a further surface form within a predetermined neighborhood of the location; and determining a similarity between the target surface form and a candidate resource for the target surface form based on the overall word-bag context and the overall resource context. A system for disambiguation in mention detection is also provided.
    Type: Grant
    Filed: October 29, 2015
    Date of Patent: January 8, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Feng Cao, Yuan Ni, Noam Slonim, Qiongkai Xu, Hui Jia Zhu
  • Patent number: 10176161
    Abstract: A computer-implemented method includes determining one or more information artifacts and determining at least one of one or more input properties or one or more content properties. The one or more input properties and the one or more content properties are each associated with the one or more information artifacts. The computer-implemented method further includes determining whether the one or more input information artifacts comprise one or more emotional indications based on at least one of the one or more input properties or the one or more content properties. The computer-implemented method further includes determining whether to perform one or more responsive actions based on the one or more emotional indications. A corresponding computer program product and computer system are also disclosed.
    Type: Grant
    Filed: January 28, 2016
    Date of Patent: January 8, 2019
    Assignee: International Business Machines Corporation
    Inventors: Ilse M. Breedvelt-Schouten, Jana H. Jenkins, Jeffrey A. Kusnitz, John A. Lyons
  • Patent number: 10176256
    Abstract: In accordance with one embodiment, a method can be implemented that comprises receiving as an input a title of a video from a video sharing web site; parsing the title of the video into one or more n-grams; computing with a computer a title-searchability-score by utilizing the one or more n-grams.
    Type: Grant
    Filed: December 22, 2014
    Date of Patent: January 8, 2019
    Assignee: BROADBANDTV, Corp
    Inventors: Mehrdad Fatourechi, Lino E. Coria Mendoza, Shahrzad Rafati
  • Patent number: 10175968
    Abstract: A first computing entity receives a respective set of software discovery rules from each of a plurality of other computing entities. Based on each of the sets of software discovery rules, a set of software discovery rules associated with the first computing entity is modified. The modified set of software discovery rules is communicated to each of the plurality of other computing entities.
    Type: Grant
    Filed: December 20, 2012
    Date of Patent: January 8, 2019
    Assignee: International Business Machines Corporation
    Inventors: Piotr Kania, Tomasz Stopa
  • Patent number: 10170014
    Abstract: A computer-implemented method for creating question-answer pairs is provided. The computer-implemented method includes leveraging domain specific resources including, at least one or more of lexicons, glossaries, or knowledge bases for constructing templates for creating the question-answer pairs. The computer implemented method further includes leveraging user experiences of a plurality of users for constructing templates. The computer implemented method further includes eliminating erroneous question-answer pairs based on templates specifications of a heuristic process of the constructed templates.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: January 1, 2019
    Assignee: International Business Machines Corporation
    Inventors: William A. Beason, Swaminathan Chandrasekaran, Anne E. Gattiker, Lakshminarayanan Krishnamurthy, Sridhar Sudarsan
  • Patent number: 10171675
    Abstract: A virtual assistant application, system, and service that interacts with a caller over a telephone, cellular, or VoIP network as the called party's representative when the called party is unavailable, providing answers to queries and performing commands that were provisioned by the called party. The application includes a telephony interface, speech recognition and synthesis, a dialog manager, and knowledge domain. The application can also be configured to interact with a sender of SMS, MMS, or IM messages.
    Type: Grant
    Filed: October 10, 2016
    Date of Patent: January 1, 2019
    Assignee: Open Invention Network, LLC
    Inventor: Gerald McCobb
  • Patent number: 10170103
    Abstract: A method, a system, and a computer program product are provided for discriminatively training a feature-space transform. The method includes performing feature-space discriminative training (f-DT) on an initialized feature-space transform, using manually transcribed data, to obtain a pre-stage trained feature-space transform. The method further includes performing f-DT on the pre-stage trained feature-space transform as a newly initialized feature-space transform, using automatically transcribed data, to obtain a main-stage trained feature-space transform. The method additionally includes performing f-DT on the main-stage trained feature-space transform as a newly initialized feature-space transform, using manually transcribed data, to obtain a post-stage trained feature-space transform.
    Type: Grant
    Filed: January 22, 2016
    Date of Patent: January 1, 2019
    Assignee: International Business Machines Corporation
    Inventor: Takashi Fukuda
  • Patent number: 10169329
    Abstract: Systems and processes for exemplar-based natural language processing are provided. In one example process, a first text phrase can be received. It can be determined whether editing the first text phrase to match a second text phrase requires one or more of inserting, deleting, and substituting a word of the first text phrase. In response to determining that editing the first text phrase to match the second text phrase requires one or more of inserting, deleting, and substituting a word of the first text phrase, one or more of an insertion cost, a deletion cost, and a substitution cost can be determined. A semantic edit distance between the first text phrase and the second text phrase in a semantic space can be determined based on one or more of the insertion cost, the deletion cost, and the substitution cost.
    Type: Grant
    Filed: July 26, 2016
    Date of Patent: January 1, 2019
    Assignee: Apple Inc.
    Inventors: Richard L. Futrell, Thomas R. Gruber
  • Patent number: 10169328
    Abstract: A mechanism is provided in a data processing system for identifying nonsense passages. The mechanism annotates an input passage with linguistic features to form an annotated passage. The mechanism counts a number of instances of each type of linguistic feature in the annotated passage to form a set of feature counts. The mechanism determines a value for a metric based on the set of feature counts and compares the value for the metric to a predetermined model threshold. The mechanism identifies whether the input passage is a nonsense passage based on a result of the comparison.
    Type: Grant
    Filed: May 12, 2016
    Date of Patent: January 1, 2019
    Assignee: International Business Machines Corporation
    Inventors: Charles E. Beller, Michael Drzewucki, Christopher Phipps, Kristen M. Summers, Julie T. Yu
  • Patent number: 10169315
    Abstract: A neural network may be used to remove personal information from text (such as names, addresses, credit card numbers, or social security numbers), and replace the personal information with a label indicating the type or class of the removed information. The neural network may comprise multiple layers that compute a context vector for words of the text, compute label scores for words of the text using the context vectors, and select a label for each word using the label scores. Words corresponding to certain labels may be replaced with a label, such as replacing the digits of a credit card number with a label <cc_number>. The redacted text may then be presented to a person or stored for later processing.
    Type: Grant
    Filed: April 27, 2018
    Date of Patent: January 1, 2019
    Assignee: ASAPP, INC.
    Inventors: Frederick William Poe Heckel, Shawn Henry
  • Patent number: 10162813
    Abstract: In language evaluation systems, user expressions are often evaluated by speech recognizers and language parsers, and among several possible translations, a highest-probability translation is selected and added to a dialog sequence. However, such systems may exhibit inadequacies by discarding alternative translations that may initially exhibit a lower probability, but that may have a higher probability when evaluated in the full context of the dialog, including subsequent expressions. Presented herein are techniques for communicating with a user by formulating a dialog hypothesis set identifying hypothesis probabilities for a set of dialog hypotheses, using generative and/or discriminative models, and repeatedly re-ranks the dialog hypotheses based on subsequent expressions.
    Type: Grant
    Filed: November 21, 2013
    Date of Patent: December 25, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ruhi Sarikaya, Daniel Boies, Paul A. Crook, Jean-Philippe Robichaud
  • Patent number: 10162613
    Abstract: A method and system are provided including a parser module; a display; a memory storing processor-executable process steps; and a parser processor coupled to the memory, and in communication with the parser module and operative to execute the processor-executable steps to cause the system to: provide a user interface to a user via the display, the user interface including one or more data-entry fields; receive an expression in the one or more data-entry fields; transform the expression into a tree structure, wherein the tree structure is usable by two or more different runtime engines; and output the tree structure. Numerous other aspects are provided.
    Type: Grant
    Filed: July 18, 2017
    Date of Patent: December 25, 2018
    Assignee: SAP PORTALS ISRAEL LTD.
    Inventors: Efrat Vilozny, Anita Demayo, Ronen Halbani
  • Patent number: 10157221
    Abstract: A processor expands a search expression. The processor determines nodes representing query terms of a search expression. The nodes have associated text for search term expansion, and represent at least one concept in a semantic graph of nodes that represents a domain of semantically related concepts. The processor determines i) a center of focus within the semantic graph for the two or more nodes based, at least in part, on a spreading activation in the graph and ii) a contextual relevance for the two or more nodes with respect to node the center of focus. The processor selects, for a query term, a node based on contextual relevance between that node and the query term and expands the search expression using an associated text of that node.
    Type: Grant
    Filed: September 25, 2015
    Date of Patent: December 18, 2018
    Assignee: International Business Machines Corporation
    Inventors: Seamus R. Mac an tSaoir, Daniel J. McCloskey, Ahmed M. M. R. Salem, Mikhail Sogrin
  • Patent number: 10157201
    Abstract: A method of searching for and providing information about a natural language query having a simple or complex sentence structure, includes: generating a mashup query language having a tree structure in a plurality of levels based on at least one query entity included in a natural language query language via a semantic analysis of the natural language query language; determining whether the plurality of levels are linked through a query entity forming each of the plurality of levels based on attribute information of the mashup query language; searching for data corresponding to the query entity forming each of the plurality of levels from a knowledge database based on a result of the determining, and deriving main information and at least one piece of entity information corresponding to the natural language query language from found data; and laying out a search result screen including the main information and the at least one piece of entity information.
    Type: Grant
    Filed: July 10, 2015
    Date of Patent: December 18, 2018
    Assignee: NAVER Corporation
    Inventors: Jae Hyeok Chang, Ki Young Kim, Myeong Seok Kim, Ji Hye Choi, Won Jin Lee, Hyun Ah Lee, Yong Hun Lee
  • Patent number: 10157223
    Abstract: A device may obtain text to be processed to identify a trend associated with a topic included in the text. The text may include a plurality of text sections, associated with the topic, that may be associated with a plurality of temporal intervals. The device may determine a respective context for the topic in each of the plurality of text sections. The device may calculate a first specificity score based on the respective context for the topic for one or more text sections associated with the first temporal interval. The device may calculate a second specificity score based on the respective context for the topic for one or more text sections associated with the second temporal interval. The device may identify a trend associated with the topic based on the first specificity score and the second specificity score, and may provide information that identifies the trend.
    Type: Grant
    Filed: May 2, 2016
    Date of Patent: December 18, 2018
    Assignee: Accenture Global Solutions Limited
    Inventors: Janardan Misra, Shubhashis Sengupta, Sanjay Podder, Robert E. Kress, Ashwin Menon
  • Patent number: 10157220
    Abstract: A processor expands a search expression. The processor determines nodes representing query terms of a search expression. The nodes have associated text for search term expansion, and represent at least one concept in a semantic graph of nodes that represents a domain of semantically related concepts. The processor determines i) a center of focus within the semantic graph for the two or more nodes based, at least in part, on a spreading activation in the graph and ii) a contextual relevance for the two or more nodes with respect to node the center of focus. The processor selects, for a query term, a node based on contextual relevance between that node and the query term and expands the search expression using an associated text of that node.
    Type: Grant
    Filed: July 23, 2015
    Date of Patent: December 18, 2018
    Assignee: International Business Machines Corporation
    Inventors: Seamus R. Mac an tSaoir, Daniel J. McCloskey, Ahmed M. M. R. Salem, Mikhail Sogrin
  • Patent number: 10152971
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for advanced turn-taking in an interactive spoken dialog system. A system configured according to this disclosure can incrementally process speech prior to completion of the speech utterance, and can communicate partial speech recognition results upon finding particular conditions. A first condition which, if found, allows the system to communicate partial speech recognition results, is that the most recent word found in the partial results is statistically likely to be the termination of the utterance, also known as a terminal node. A second condition is the determination that all search paths within a speech lattice converge to a common node, also known as a pinch node, before branching out again. Upon finding either condition, the system can communicate the partial speech recognition results. Stability and correctness probabilities can also determine which partial results are communicated.
    Type: Grant
    Filed: June 23, 2016
    Date of Patent: December 11, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Jason D. Williams, Ethan Selfridge
  • Patent number: 10147427
    Abstract: A method for electronically utilizing content in a communication between a customer and a customer representative is provided. An audible conversation between a customer and a service representative is captured. At least a portion of the audible conversation is converted into computer searchable data. The computer searchable data is analyzed during the audible conversation to identify relevant meta tags previously stored in a data repository or generated during the audible conversation. Each meta tag is associated with the customer. Each meta tag provides a contextual item determined from at least a portion of one of a current or previous conversation with the customer. A meta tag determined to be relevant to the current conversation between the service representative and the customer is displayed in real time to the service representative currently conversing with the customer.
    Type: Grant
    Filed: September 17, 2015
    Date of Patent: December 4, 2018
    Assignee: United Services Automobile Association
    Inventors: Zakery L. Johnson, Jonathan E. Neuse
  • Patent number: 10146756
    Abstract: Various techniques for providing perspective annotation to numerical representations are disclosed herein. For example, a method includes detecting a numerical representation in an original content and retrieving one or more perspectives from a database based on the detected numerical representation. The one or more perspectives individually include a restatement of information contained in the numerical representation. The method can also include annotating the original content with the retrieved one or more perspectives to form an annotated content.
    Type: Grant
    Filed: March 13, 2013
    Date of Patent: December 4, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jake Hofman, Miroslav Dudik, Daniel Goldstein
  • Patent number: 10140983
    Abstract: A method, a system, and a computer program product for building an n-gram language model for an automatic speech recognition. The method includes reading training text data and additional text data both for the n-gram language model from a storage, and building the n-gram language model by a smoothing algorithm having discount parameters for n-gram counts. The additional text data includes plural sentences having at least one target keyword. Each discount parameter for each target keyword is tuned using development data which are different from the additional text data so that a predetermined balance between precision and recall is achieved.
    Type: Grant
    Filed: August 28, 2015
    Date of Patent: November 27, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gakuto Kurata, Toru Nagano, Masayuki Suzuki, Ryuki Tachibana
  • Patent number: 10140272
    Abstract: Mechanisms are provide for implementing a context aware abbreviation detection and annotation operation. An instance of a full name of an entity is identified in received content and analysis of a context window associated with the instance of the full name of the entity is performed to identify a presence of a pattern of content representative of an abbreviation. An abbreviation is identified as being present in association with the instance of the full name of the entity based on results of the analysis of the context window and a mapping data structure that maps the full name of the entity to the abbreviation is generated. The received content is annotated based on the mapping data structure to thereby generate abbreviation annotations for the received content. The annotated received content is output for use by a cognitive system to perform a cognitive operation based on the annotated received content.
    Type: Grant
    Filed: September 25, 2015
    Date of Patent: November 27, 2018
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Neil S. Bhargava, Richard L. Martin