Patents by Inventor Daisuke Takuma

Daisuke Takuma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10353932
    Abstract: A method includes hierarchically identifying occurrences of some of the words in the set of sentences; creating a first index for each of some of the words based on the upper hierarchy of occurrences identified for each word; receiving input of a queried word; hierarchically identifying occurrences of the queried word in the set of sentences; creating a second index based on the upper hierarchy of occurrences identified for the queried word; comparing the first index and the second index to calculate an estimated value for the number of occurrences of a word in the neighborhood of the queried word; and calculating the actual value of the number of occurrences of a word in the neighborhood of the queried word based on an upper hierarchy and lower hierarchy of the occurrences on condition that the estimated value is equal to or greater than a predetermined number.
    Type: Grant
    Filed: August 22, 2016
    Date of Patent: July 16, 2019
    Assignee: International Business Machines Corporation
    Inventors: Daisuke Takuma, Hiroki Yanagisawa
  • Publication number: 20190179910
    Abstract: Methods and systems for searching for similar documents include comparing an input index of a requested document to one or more stored indices for respective stored documents to produce a similarity score for each of the stored documents. Each index indicates which of a plurality of queries matched a respective document. The stored documents are filtered to remove dissimilar documents based on a comparison of each respective similarity score to a threshold. A list of any stored documents that remain after said filtering is output.
    Type: Application
    Filed: December 13, 2017
    Publication date: June 13, 2019
    Inventor: Daisuke Takuma
  • Patent number: 10296579
    Abstract: Aspects of the present invention disclose a method, computer program product, and system for generating target text based on target data. The method includes one or more processors decomposing one or more portions of text into at least one corresponding keyword and at least one corresponding template. The method further includes learning a classification model associated with selecting a template based on a category of a keyword. The method further includes identifying a target keyword that is represented by target data. The method further includes selecting a target template that is used to represent the target data based on a category associated with the identified target keyword utilizing the classification model. The method further includes generating target text that represents the target data based on the selected text template based on the selected target template and the identified target keyword.
    Type: Grant
    Filed: November 2, 2016
    Date of Patent: May 21, 2019
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Publication number: 20190147040
    Abstract: Embodiments relate to a type of expression based on a particular theme. An aspect includes acquiring, by an electronic apparatus, from text data for learning, a subset of the text data associated with the particular theme and with particular time period information. Another aspect includes extracting text data containing negative information from the acquired subset of the text data. Another aspect includes extracting a word or phrase having a high correlation with the extracted text data or a word or phrase having a high appearance frequency in the extracted text data from the extracted text data. Yet another aspect includes determining that the extracted word or phrase is the type of expression based on the particular theme.
    Type: Application
    Filed: January 16, 2019
    Publication date: May 16, 2019
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 10289674
    Abstract: Aspects of the present invention disclose a method, computer program product, and system for generating target text based on target data. The method includes one or more processors decomposing one or more portions of text into at least one corresponding keyword and at least one corresponding template. The method further includes learning a classification model associated with selecting a template based on a category of a keyword. The method further includes identifying a target keyword that is represented by target data. The method further includes selecting a target template that is used to represent the target data based on a category associated with the identified target keyword utilizing the classification model. The method further includes generating target text that represents the target data based on the selected text template based on the selected target template and the identified target keyword.
    Type: Grant
    Filed: September 29, 2015
    Date of Patent: May 14, 2019
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 10275447
    Abstract: Embodiments relate to a type of expression based on a particular theme. An aspect includes acquiring, by an electronic apparatus, from text data for learning, a subset of the text data associated with the particular theme and with particular time period information. Another aspect includes extracting text data containing negative information from the acquired subset of the text data. Another aspect includes extracting a word or phrase having a high correlation with the extracted text data or a word or phrase having a high appearance frequency in the extracted text data from the extracted text data. Yet another aspect includes determining that the extracted word or phrase is the type of expression based on the particular theme.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: April 30, 2019
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Publication number: 20190122144
    Abstract: A method, a system, and a computer program product for predicting an outcome expected for a particular positional value is provided. In the method, an input set of data records each having a label and a positional value, and a target positional value are obtained. The label of each data record is one in a label set. A learning model that includes an output layer, an input layer corresponding to the label set and a network structure provided therebetween is read. In the learning model, the network structure has a plurality of functions trained so as to evaluate influence from each label in the label set depending on relationship between the target positional value and a representative positional value associated with the label in the label set. A target outcome is estimated for the target positional value from the input set using the learning model.
    Type: Application
    Filed: October 25, 2017
    Publication date: April 25, 2019
    Inventors: Yoshinori Kabeya, Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 10176274
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Grant
    Filed: March 2, 2018
    Date of Patent: January 8, 2019
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Publication number: 20180349401
    Abstract: A computer-implemented method for checking a search query used for retrieving information from a collection of records in an event sequence analysis system is disclosed. The method includes receiving a search query including a plurality of predicates and one or more constraints each associated with plural predicates, in which each predicate is specified to be positive or negative, and the plurality of the predicates forms a logical graph together with the one or more constraints. The method also includes generating from the graph a subgraph including one or more predicates specified to be positive among the plurality of the predicates. The method further includes determining whether the subgraph has plural components or not, and invalidating the search query in response to the subgraph being determined to have plural components.
    Type: Application
    Filed: May 31, 2017
    Publication date: December 6, 2018
    Inventors: Yohei Ikawa, Daisuke Takuma
  • Publication number: 20180196888
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Application
    Filed: March 2, 2018
    Publication date: July 12, 2018
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 10007740
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Grant
    Filed: September 26, 2017
    Date of Patent: June 26, 2018
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 10002450
    Abstract: A hardware device analyzes a document that includes a text-based visual representation. A correspondence information hardware storage device holds known representations of graphical images as text-based visual representations. The graphical images depict portraits of physical objects. The text-based visual representations are associated with information that each describe one of the physical objects. An identification hardware device identifies a text-based visual representation within a document. The identification hardware device matches the text-based visual representation within the document to one or more of the text-based visual representations stored in the correspondence information hardware storage device. An editing hardware device retrieves information from the correspondence information hardware storage device that is identified, by the identification hardware device, as describing a text-based visual representation component within the document.
    Type: Grant
    Filed: July 14, 2014
    Date of Patent: June 19, 2018
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 9946812
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Grant
    Filed: September 26, 2017
    Date of Patent: April 17, 2018
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 9916397
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Grant
    Filed: November 9, 2016
    Date of Patent: March 13, 2018
    Assignee: International Business Machines Corporation
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 9892091
    Abstract: A computer-implemented method for computing an intersection or an intersection cardinality of each pair of a set in a first list of a plurality of sets and a set in a second list of a plurality of sets, the method including calculating a first union of a predetermined number of sets in the first list, obtaining filtered sets of the second list by filtering out an element from the plurality of sets in the second list, the element being not included in the first union, and intersecting a set in the first list and a set in the filtered sets of the second list.
    Type: Grant
    Filed: May 31, 2017
    Date of Patent: February 13, 2018
    Assignee: International Business Machines Corporation
    Inventor: Daisuke Takuma
  • Publication number: 20180018405
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Application
    Filed: September 26, 2017
    Publication date: January 18, 2018
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Publication number: 20180018406
    Abstract: Embodiments relate to generating a retrieval condition for retrieving a target character string from texts by pattern matching. An aspect includes dividing a first text into words. Another aspect includes generating a converted character string by performing at least one of appending at least one character in at least either one of previous and subsequent positions of the target character string. Another aspect includes replacing at least one character of the target character string. Another aspect includes generating the retrieval condition for retrieval candidates in the words of the first text, the retrieval condition comprising determining that a retrieval candidate matches the target character string and does not match the converted character string based on a ratio of a part of the retrieval candidate which matches the converted character string and corresponds to the target character string is less than or equal to a reference frequency.
    Type: Application
    Filed: September 26, 2017
    Publication date: January 18, 2018
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Publication number: 20180004727
    Abstract: A keyword to be categorized is received. A category dictionary including categories having associated registered keywords, and a text corpus are received. Registered keywords are identified in the category dictionary having a degree of similarity to the keyword to be categorized that is equal to or greater than a predetermined value, and the categories associated with the identified registered keywords are extracted. Registered keywords are identified that are co-occurring in the text corpus with the keyword to be categorized, and the categories associated with the identified co-occurring registered keywords are extracted. A degree of importance is determined for each extracted category based on a function of the identified registered keywords in the category dictionary and/or a function of the identified co-occurring registered keywords.
    Type: Application
    Filed: September 15, 2017
    Publication date: January 4, 2018
    Inventors: Emiko Takeuchi, Daisuke Takuma, Hirobumi Toyoshima
  • Patent number: 9858267
    Abstract: An apparatus for calculating a correlation between annotations includes a first obtaining unit configured to provide an annotator with a first data group capable of being evaluated to determine whether or not to attach annotations thereto, and obtaining a plurality of first confidence levels indicating certainty of the annotations in the first data group, the annotator outputting confidence levels indicating certainty of annotations to be attached to data when the data is given; a second obtaining unit configured to provide the annotator with a second data group used to calculate a correlation between the plurality of annotations, and thereby obtaining a plurality of second confidence levels indicating the certainty of the annotations in the second data group; and a computing unit configured to compute an estimated value of the correlation between the plurality of annotations based on the plurality of first and second confidence levels.
    Type: Grant
    Filed: June 24, 2015
    Date of Patent: January 2, 2018
    Assignee: International Business Machines Corporation
    Inventors: Yuki Makino, Takuma Murakami, Daisuke Takuma
  • Patent number: 9858266
    Abstract: An apparatus for calculating a correlation between annotations includes a first obtaining unit configured to provide an annotator with a first data group capable of being evaluated to determine whether or not to attach annotations thereto, and obtaining a plurality of first confidence levels indicating certainty of the annotations in the first data group, the annotator outputting confidence levels indicating certainty of annotations to be attached to data when the data is given; a second obtaining unit configured to provide the annotator with a second data group used to calculate a correlation between the plurality of annotations, and thereby obtaining a plurality of second confidence levels indicating the certainty of the annotations in the second data group; and a computing unit configured to compute an estimated value of the correlation between the plurality of annotations based on the plurality of first and second confidence levels.
    Type: Grant
    Filed: March 16, 2015
    Date of Patent: January 2, 2018
    Assignee: International Business Machines Corporation
    Inventors: Yuki Makino, Takuma Murakami, Daisuke Takuma