Patents by Inventor Dai Kusui

Dai Kusui has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM

Publication number: 20120303611

Abstract: The information processing device 1 processes document collections having tags permitting semantic class identification appended to each document and comprises a search unit 2, which creates multiple semantic class units containing one, two, or more semantic classes based on a taxonomy that identifies relationships between semantic classes, and a frequency calculation unit 3 which, for each of the semantic class units, identifies documents that match that semantic class unit in the document collections and, for these matching documents, calculates a first frequency that represents the frequency of occurrence in a designated document collection and a second frequency that represents the frequency of occurrence in non-designated document collections. Once the calculations have been performed, the search unit 2 identifies any of the semantic class units based on the first frequency and the second frequency of the matching documents.

Type: Application

Filed: December 21, 2010

Publication date: November 29, 2012

Applicant: NEC CORPORATION

Inventors: Yukitaka Kusumura, Hironori Mizuguchi, Dai Kusui
DOCUMENT CLUSTERING SYSTEM, DOCUMENT CLUSTERING METHOD, AND RECORDING MEDIUM

Publication number: 20120259855

Abstract: In the provided document clustering system (100), a concept tree structure accumulation unit (11) stores a concept tree structure that represents a hierarchical relationship among concepts represented by each of a plurality of words. For any two words, a concept similarity computation unit (12) obtains a concept similarity, which is an index indicating how close the concepts represented by the two words are. Using concept similarities for words that appear in two documents in a document set, an inter-document similarity computation unit (13) obtains an inter-document similarity, which indicates how similar the two documents are semantically. A clustering unit (14) uses inter-document similarities to cluster the documents in the document set.

Type: Application

Filed: December 21, 2010

Publication date: October 11, 2012

Applicant: NEC CORPORATION

Inventors: Hironori Mizuguchi, Dai Kusui
RELATED DOCUMENT SEARCH SYSTEM, DEVICE, METHOD AND PROGRAM

Publication number: 20120239654

Abstract: Provided is a related document search system which can provide supplementary information showing a related content together with a related document related to a predetermined document.

Type: Application

Filed: November 26, 2010

Publication date: September 20, 2012

Applicant: NEC CORPORATION

Inventors: Kenji Tateishi, Itaru Hosomi, Dai Kusui
System and method for judging properties of an ontology and updating same

Patent number: 8244769

Abstract: To provide a technique for structuralizing ontology in a prescribed form to a structure to which features of data are reflected. An ontology processing device has a structuralizing device for structuralizing properties of the ontology in the prescribed form generated from a set of instance data containing a combination of a subject, a property, and an object expressed with a character string according to the features of the object, and has a ontology storage device which stores the ontology structuralized by the structuralizing device. With this structure, the properties of the ontology in the prescribed form are corrected or expressed as an ontology structure by reflecting the characteristics of a set of the objects obtained from the data.

Type: Grant

Filed: May 27, 2008

Date of Patent: August 14, 2012

Assignee: NEC Corporation

Inventors: Itaru Hosomi, Hironori Mizuguchi, Dai Kusui
INFORMATION CLASSIFICATION DEVICE, INFORMATION CLASSIFICATION METHOD, AND COMPUTER READABLE RECORDING MEDIUM

Publication number: 20120143801

Abstract: An information classification device (1) is provided with an union of sets determination unit (10) which performs correct/incorrect determination regarding a content to be classified using a union of sets rule, and an individual determination unit (11) which applies a plurality of individual determination rules to the content to be classified which has been determined as correct, determines whether the content matches the condition, and performs correct/incorrect determination again regarding the content to be classified which has been determined as correct on the basis of the determination result of each individual determination rule. The union of sets determination rule is created using a result of correct/incorrect determination previously performed by two or more people with respect to a plurality of contents which are different from the contents to be classified, and also using feature amounts of respective different contents.

Type: Application

Filed: June 1, 2010

Publication date: June 7, 2012

Applicant: NEC CORPORATION

Inventors: Masaaki Tsuchida, Hironori Mizuguchi, Dai Kusui
CLASSIFICATION HIERARCHY REGENERATION SYSTEM, CLASSIFICATION HIERARCHY REGENERATION METHOD, AND CLASSIFICATION HIERARCHY REGENERATION PROGRAM

Publication number: 20120109963

Abstract: A classification hierarchy regeneration system is provided, wherein when a new classification hierarchy is generated by restructuring an existing classification hierarchy, a classification hierarchy in view of hierarchical relationship of classifications and a classification hierarchy integrating classifications of the same meaning can be efficiently generated. The clustering means clusters a data group associated with a hierarchical classification, and generating a classification group, i.e., a group obtained by extracting a classification satisfying a condition defined in advance from classifications corresponding to respective data in a cluster. The cooccurrence degree calculation means calculates a degree of cooccurrence of two classifications selected from the classification group. The classification hierarchy regeneration means regenerates the hierarchy of classification based on the classification group and the degree of cooccurrence.

Type: Application

Filed: April 20, 2010

Publication date: May 3, 2012

Applicant: NEC CORPORATION

Inventors: Hironori Mizuguchi, Dai Kusui
TRAINING DATA GENERATION APPARATUS, CHARACTERISTIC EXPRESSION EXTRACTION SYSTEM, TRAINING DATA GENERATION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM

Publication number: 20120030157

Abstract: The disclosed apparatus uses a training data generation apparatus 2, which generates training data used for creating characteristic expression extraction rules. The training data generation apparatus 2 includes: a training data candidate clustering unit 21, which clusters a plurality of training data candidates assigned labels indicating annotation classes based on feature values containing respective context information, and a training data generation unit 22 which, by referring to each cluster obtained using the clustering results, obtains the distribution of the labels of the training data candidates within the cluster, identifies training data candidates that meet a preset condition based on the obtained distribution, and generates training data using the identified training data candidates.

Type: Application

Filed: March 17, 2010

Publication date: February 2, 2012

Applicant: NEC CORPORATION

Inventors: Masaaki Tsuchida, Hironori Mizuguchi, Dai Kusui
DATA CLASSIFIER SYSTEM, DATA CLASSIFIER METHOD AND DATA CLASSIFIER PROGRAM

Publication number: 20110179037

Abstract: A data classifier system of the present invention selects a plurality of classifications correlated to data groups so as to output classification axes based on hierarchical classifications and data groups. The data classifier system includes a basic category accumulation means, a classification axis candidate creation means and a priority calculation means. The basic category accumulation means accumulates classifications serving as basic categories used for selecting desired classifications in advance. The classification axis candidate creation means creates classification axis candidates based on combinations of classifications each correlated to at least one data among descendant classifications of each basic category. The priority calculation means calculates priorities with respect to the classification axis candidates created by the classification axis candidate creation means based on hierarchical distances of classifications in the classified hierarchy.

Type: Application

Filed: July 29, 2009

Publication date: July 21, 2011

Inventors: Hironori Mizuguchi, Kenji Tateishi, Itaru Hosomi, Dai Kusui
INFORMATION EXTRACTION SYSTEM, INFORMATION EXTRACTION METHOD, INFORMATION EXTRACTION PROGRAM, AND INFORMATION SERVICE SYSTEM

Publication number: 20110161144

Abstract: According to the present invention, phrases of the same kind can be extracted from a plurality of documents having various formats. A storage device stores a plurality of documents that have various formats. A pattern candidate creating unit receives a list of input words that are selected as samples among phrases that are to be included in a dictionary. The pattern candidate creating unit selects one document, determines forward and backward character strings of input words in the selected document as candidates of patterns, and stores the forward and backward character strings as a pattern candidate. The pattern candidate creating unit executes the above processes for each of the documents. A phrase candidate creating unit extracts phrases interposed between patterns included in the pattern candidate as candidates of phrases to be output, and stores the extracted phrases as a phrase candidate.

Type: Application

Filed: March 23, 2007

Publication date: June 30, 2011

Applicant: NEC CORPORATION

Inventors: Hironori Mizuguchi, Masaaki Tsuchida, Dai Kusui, Hideki Kawai
DATA CLASSIFIER SYSTEM, DATA CLASSIFIER METHOD AND DATA CLASSIFIER PROGRAM

Publication number: 20110153615

Abstract: A data classifier system of the present invention selects a plurality of classifications correlated to data groups so as to output classification axes based on hierarchical classifications and data groups. The data classifier system includes a basic category accumulation means, a classification axis candidate reduction means and a priority calculation means. The basic category accumulation means accumulates classifications serving as basic categories used for desired classifications in advance. The classification axis candidate reduction means selects a plurality of classifications from among classifications descendant from each basic category so as to create classification axis candidates, thus reducing classification axis candidates subjected to calculations based on data quantity of classifications and hierarchical distances of classifications.

Type: Application

Filed: July 29, 2009

Publication date: June 23, 2011

Inventors: Hironori Mizuguchi, Kenji Tateishi, Itaru Hosomi, Dai Kusui
COOCCURRENCE DICTIONARY CREATING SYSTEM, SCORING SYSTEM, COOCCURRENCE DICTIONARY CREATING METHOD, SCORING METHOD, AND PROGRAM THEREOF

Publication number: 20110055228

Abstract: A cooccurrence dictionary creating system includes: a language analyzing section which subjects a text to a morpheme analysis, a clause specification, and a modification relationship analysis between clauses, a cooccurrence relationship collecting section which collects cooccurrences of nouns in each clause of the text, modification relationships of nouns and declinable words, and modification relationships between declinable words as cooccurrence relationships, a cooccurrence score calculating section which calculates a cooccurrence score of the cooccurrence relationship based on a frequency of the collected cooccurrence relationship, and a cooccurrence dictionary storage section which stores a cooccurrence dictionary in which a correspondence between the calculated cooccurrence score and the cooccurrence relationship is described.

Type: Application

Filed: April 1, 2009

Publication date: March 3, 2011

Inventors: Masaaki Tsuchida, Hironori Mizuguchi, Dai Kusui
WORD CLASSIFICATION SYSTEM, METHOD, AND PROGRAM

Publication number: 20110029303

Abstract: A word classification system is provided with an inter-word pattern learning section for learning at least either the context information or the layout information between classification-known words which co-appear and creating an inter-word pattern for determining whether data relating to a word pair which is a combination of words is data relating to a same-classification word pair which is the combination of words in the same classification or data relating to a different-classification word pair which is a combination of words in different classifications on the basis of the relationship between the classification-known words which co-appear in a document.

Type: Application

Filed: April 2, 2009

Publication date: February 3, 2011

Inventors: Hironori Mizuguchi, Masaaki Tsuchida, Dai Kusui
ATTRIBUTE EXTRACTION METHOD, SYSTEM, AND PROGRAM

Publication number: 20100318525

Abstract: Sets of strings of which the drawing positions are arranged in one direction are extracted from a document as attribute groups. An attribute name score is calculated for each attribute group to determine an extent to which each attribute group is a set of attribute names. Based on the attribute name scores, an attribute name group is selected out of the attribute groups. From among the attribute groups, an attribute group which includes a string which is the same as at least one string of the attribute name group and of which the drawing position is the same as that of the string of the attribute name group is selected. From the string at the same drawing position, an attribute name is extracted. From the other strings of the selected attribute group than those at the same drawing position, an attribute value corresponding to the attribute name is extracted.

Type: Application

Filed: March 5, 2009

Publication date: December 16, 2010

Inventors: Hironori Mizuguchi, Masaaki Tsuchida, Dai Kusui
Data clustering system, data clustering method, and data clustering program

Patent number: 7827179

Abstract: An object of the present invention is to perform data clustering while preventing the processing speed from decreasing while maintaining accuracy. A block division section 3 divides a block received from a DB access section 2 into sufficiently small blocks. A block storage section 8 stores blocks supplied from the block division section 3 and hierarchical relationship between the blocks. A block integration section 5 integrates blocks and groups in the order from a hierarchically deeper position to a shallower position based on the stored hierarchical relationship.

Type: Grant

Filed: September 1, 2006

Date of Patent: November 2, 2010

Assignee: NEC Corporation

Inventors: Dai Kusui, Kenji Tateishi, Haruka Saito
ONTOLOGY PROCESSING DEVICE, ONTOLOGY PROCESSING METHOD, AND ONTOLOGY PROCESSING PROGRAM

Publication number: 20100121885

Abstract: To provide a technique for structuralizing ontology in a prescribed form to a structure to which features of data are reflected. An ontology processing device has a structuralizing device for structuralizing properties of the ontology in the prescribed form generated from a set of instance data containing a combination of a subject, a property, and an object expressed with a character string according to the features of the object, and has a ontology storage device which stores the ontology structuralized by the structuralizing device. With this structure, the properties of the ontology in the prescribed form are corrected or expressed as an ontology structure by reflecting the characteristics of a set of the objects obtained from the data.

Type: Application

Filed: May 27, 2008

Publication date: May 13, 2010

Applicant: NEC CORPORATION

Inventors: Itaru Hosomi, Hironori Mizuguchi, Dai Kusui
FIELD CORRELATION METHOD AND SYSTEM, AND PROGRAM THEREOF

Publication number: 20100100804

Abstract: A field pair as a combination of a definite field and an indefinite field is decided and a correlation value between the definite field and the indefinite field in each of the field pairs is calculated. Among the field pairs in which the correlation value is not smaller than a threshold value, indefinite fields having corresponding definite fields which belong to the same field group are made to be a new field group.

Type: Application

Filed: March 4, 2008

Publication date: April 22, 2010

Inventors: Kenji Tateishi, Dai Kusui
Search method, similarity calculation method, similarity calculation, same document matching system, and program thereof

Publication number: 20100023505

Abstract: Same document group creation means (11) acquires a ratio of common words and characters between documents in order to obtain a predetermined similarity greater than a predetermined threshold value between the documents. According to the ratio, words or characters are selected with a common priority in all the documents to be matched. The documents are correlated to the same document candidate group identified by the selected words or characters and stored in a same group candidate group storage unit (22).

Type: Application

Filed: September 13, 2007

Publication date: January 28, 2010

Inventors: Kenji Tateishi, Dai Kusui
POLARITY ESTIMATION SYSTEM, INFORMATION DELIVERY SYSTEM, POLARITY ESTIMATION METHOD, POLARITY ESTIMATION PROGRAM AND EVALUATION POLARITY ESTIMATIOM PROGRAM

Publication number: 20100017391

Abstract: An evaluation polarity of reputation information with an unknown evaluation polarity is estimated by utilizing reputation information with a known evaluation polarity. The present polarity estimation system is a polarity estimation system for estimating an evaluation polarity indicating whether reputation information is positive or negative, and includes a reputation information storage part that precedently stores reputation information with a known evaluation polarity; and a polarity estimating means for estimating an evaluation polarity of reputation information with an unknown evaluation polarity on the basis of the reputation information with the known evaluation polarity precedently stored in the reputation information storage part.

Type: Application

Filed: November 20, 2007

Publication date: January 21, 2010

Inventors: Hironori Mizuguchi, Dai Kusui, Masaaki Tsuchida
DATA CLUSTERING SYSTEM, DATA CLUSTERING METHOD, AND DATA CLUSTERING PROGRAM

Publication number: 20090271420

Abstract: An object of the present invention is to perform data clustering while preventing the processing speed from decreasing while maintaining accuracy. A block division section 3 divides a block received from a DB access section 2 into sufficiently small blocks. A block storage section 8 stores blocks supplied from the block division section 3 and hierarchical relationship between the blocks. A block integration section 5 integrates blocks and groups in the order from a hierarchically deeper position to a shallower position based on the stored hierarchical relationship.

Type: Application

Filed: September 1, 2006

Publication date: October 29, 2009

Applicant: NEC CORPORATION

Inventors: Dai Kusui, Kenji Tateishi, Haruka Saito
Human image dialogue device and a recording medium storing a human image dialogue device

Patent number: 6434525

Abstract: A device is provided that generates the gestures and expressions of a human image on a computer without expending a great amount of labor. The words for the system response to the input of a user and the state of the dialogue are described in a dialogue flow memory unit, a dialogue flow analysis unit analyzes the spoken text of the flow, extracts the key words associated with a movement pattern by referring to a text movement association table, and the movement expression generation unit generates the movements corresponding to the movement pattern. In the generation of the movement, movement patterns determined in advance are selected according to the state of the dialogue written in the dialogue flow, and the movement pattern is determined or modified by the key words.

Type: Grant

Filed: May 26, 1999

Date of Patent: August 13, 2002

Assignee: NEC Corporation

Inventors: Izumi Nagisa, Dai Kusui

prev 1 2 3