Patents by Inventor Shinichi Ando
Shinichi Ando has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20120304055Abstract: A document analysis apparatus comprises: a feature expression acquisition unit acquiring a feature expression appearing during an attention period in an analysis object document collection; a document collection acquisition unit acquiring a feature expression containing document (FECD) collection in which a feature expression appears, from an analysis population including an analysis object document collection; a context determination unit specifying an analysis/FECD corresponding to an analysis object document among a FECD collection for every feature expression, and specifies a context in which the feature expression appeared in multiple analysis/FECDs; a context comparison determination unit specifying a non analysis/FECD not corresponding to an analysis object document among a FECD collection, and within that, compares a context in which the feature expression has appeared and a context specified previously; and a feature degree setting unit performing giving or the like of a feature degree to a feature eType: ApplicationFiled: January 25, 2011Publication date: November 29, 2012Applicant: NEC CORPORATIONInventors: Satoshi Nakazawa, Shinichi Ando
-
Publication number: 20120284016Abstract: Disclosed are a text mining method, device, and program capable of performing text mining with a specific topic as an object with high precision. An element identification unit calculates a feature degree, which is an index for indicating a degree that within a text set of interest, which is a set of text that is to be analyzed, an element of the text appears. An output unit identifies distinctive elements within the text set of interest on the basis of the calculated feature degree and outputs the identified elements. The element identification unit corrects the feature degree on the basis of a topic relatedness degree, which is a value indicating a degree related to a topic of analysis, which is a topic for which each text portion of the text being analyzed has been partitioned into predetermined units that are to be analyzed.Type: ApplicationFiled: December 7, 2010Publication date: November 8, 2012Applicant: NEC CORPORATIONInventors: Akihiro Tamura, Kai Ishikawa, Shinichi Ando
-
Patent number: 8306933Abstract: Disclosed is an information providing system comprising a receiving unit that receives an information request from a requester, a data storage unit that stores data, a detection processing unit that analyzes the content of the information request and extracts provision candidate data corresponding to the information request from the data storage unit, a responder output device to which the content of the information request and the provision candidate data are output, a responder input device that receives instruction information on whether or not the provision candidate data is to be provided, a response control unit that determines whether or not there is providable data based on the received instruction information and the provision candidate data, and an answer generating unit that generates answer data using the decision result by the response control unit.Type: GrantFiled: May 21, 2007Date of Patent: November 6, 2012Assignee: NEC CorporationInventors: Takao Kawai, Shinichi Doi, Shinichi Ando, Kunihiko Sadamasa, Yoshiko Matsukawa
-
Publication number: 20120278327Abstract: A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information.Type: ApplicationFiled: November 8, 2010Publication date: November 1, 2012Applicant: NEC CORPORATIONInventors: Satoshi Nakazawa, Shinichi Ando, Yoshio Ishizawa, Yuzuru Okajima
-
Patent number: 8301435Abstract: A language processing device includes first analysis unit 21 that subjects a natural language sentence containing a polysemic word and other words to a predetermined analysis and outputs a plurality of analysis results for the natural language sentence according to a plurality of meanings of the polysemic word, second analysis unit 23 that performs a particular analysis on the analysis results outputted from first analysis unit 21, and employs one of the analysis results, and generation unit 244 that generates a deletion rule for deleting one or more unnecessary analysis results of the first analysis unit 21 which has been deleted from the analysis results outputted from first analysis unit 21 but employed by second analysis unit 23, according to the analysis results outputted from the first analysis unit 21 and the employment result of second analysis unit 23.Type: GrantFiled: February 9, 2007Date of Patent: October 30, 2012Assignee: NEC CorporationInventors: Kunihiko Sadamasa, Shinichi Ando, Shinichi Doi
-
Publication number: 20120259805Abstract: Disclosed is an information estimation device for estimating an appropriate issue time from a time representation described in a document without intervention of any operator; wherein an information estimation device (1) which is a device for estimating an issue time of a document to be estimated, includes a candidate generation unit (11) which extracts a time representation described in the document, and on the basis of the extracted time representation, generates a plurality of possible issue time candidates having possibilities corresponding to the issue time of the document; and an issue time estimation unit (12) for obtaining a temporal proximity, for each of the plurality of issue time candidates, between the issue time candidate and other issue time candidates, and on the basis of the obtained temporal proximity, estimating the issue time of the document.Type: ApplicationFiled: December 9, 2010Publication date: October 11, 2012Applicant: NEC CORPORATIONInventors: Takao Kawai, Shinichi Ando, Satoshi Nakazawa
-
Publication number: 20120254071Abstract: Disclosed are a text mining system, text mining method, and recording medium for suppressing increase in cost of analysis for an analyst even if, when analyzing a plurality of data to be analyzed, the data are to be integrally analyzed.Type: ApplicationFiled: December 15, 2010Publication date: October 4, 2012Applicant: NEC CORPORATIONInventors: Kai Ishikawa, Shinichi Ando, Akihiro Tamura
-
Publication number: 20120239665Abstract: Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.Type: ApplicationFiled: November 15, 2010Publication date: September 20, 2012Applicant: NEC CORPORATIONInventors: Yuzuru Okajima, Shinichi Ando, Satoshi Nakazawa
-
Publication number: 20120117068Abstract: The text mining device 300 includes a clustering section 301. The clustering section 301 performs clustering on a plurality of characteristic expressions extracted from a document set such that characteristic expressions, in which sentences to be referred to as original sentences are the same, are compiled in one cluster, based on the similarity in original document sets which are sets of documents including the respective characteristic expressions, the documents being of the document set. Consequently, the probability of repeatedly viewing the same original document by a user can be reduced reliably.Type: ApplicationFiled: April 8, 2010Publication date: May 10, 2012Applicant: NEC CORPORATIONInventors: Takashi Onishi, Shinichi Ando, Satoshi Nakazawa
-
Publication number: 20120096029Abstract: An information analysis device (30) comprises a relevant portion identification unit (31) that compares analyzed target text with topic-related text that is written about the same event as the analyzed target text and includes information related to a specific topic, and that specifies a portion of the analyzed target text related to the topic-related text; a potential topic word extraction unit (32) that extracts a word of the specific portion; and a statistical model generation unit (33) that generates a statistical model that estimates a degree of appearance of a word on a specific topic of the analyzed target text. The statistical model generation unit (33) generates a statistical model such that degrees of appearance in a specific topic of the topic-related text word and of the extracted word are higher than those of other words.Type: ApplicationFiled: May 28, 2010Publication date: April 19, 2012Applicant: NEC CORPORATIONInventors: Akihiro Tamura, Kai Ishikawa, Shinichi Ando
-
Patent number: 8140530Abstract: [PROBLEMS] To accurately calculate similarity between media data and a query even if the media data or its meta data has an error. [MEANS FOR SOLVING THE PROBLEMS] A similarity calculation device includes: a single score calculation device used when calculating similarity between first media data and a query, which calculates a single score that shows similarity between second media data different from the first media data and the query; an inter-media similarity calculation device which calculates inter-media similarity that shows the similarity between the second media data and the first media data; and a query similarity calculation device which obtains similarity between the first media data and the query by using the inter-media similarity of the second media data and the single score.Type: GrantFiled: August 2, 2007Date of Patent: March 20, 2012Assignee: NEC CorporationInventors: Makoto Terao, Takafumi Koshinaka, Shinichi Ando, Yoshifumi Onishi
-
Publication number: 20120016664Abstract: A language analysis apparatus of the invention includes division rules, each of which is classified into one of levels according to the degree of risk of causing analysis accuracy problems when applied; a division point candidate generation unit 21 which, when a character string whose length is greater than the predetermined maximum input length is input, generates division point candidates for the input character string by applying the division rules sequentially one by one in the ascending order of the level of risk of causing problems; a division point adjustment unit 22 which, when the length of a division unit candidate obtained by the division point candidate generated by the division point candidate generation unit 21 is less than the maximum input length, selects a combination of division points from among the division point candidates obtained by applying division rules of the same level while ensuring that each division unit is not greater in length than the maximum input length; and a division unitType: ApplicationFiled: March 23, 2010Publication date: January 19, 2012Applicant: NEC CORPORATIONInventors: Shinichi Ando, Kunihiko Sadamasa
-
Publication number: 20110320452Abstract: An information estimation apparatus 1 for estimating a transmission point in time of a document whose transmission point in time is not specified in a document set to be analyzed includes a structure analysis unit 3 configured to specify, from the document set, a document having a document structure in which a link relationship with another document is indicated in a table-of-contents manner, and extract the link relationship of documents included in the document set from the document structure of the specified document, a grouping unit 4 configured to set a group of documents using the specified document and the extracted link relationship, and an estimation unit 5 configured to estimate, based on the set group and a transmission point in time of a document that is included in the group and whose transmission point in time is specified, a transmission point in time of a document that is included in the group and whose transmission point in time is not specified.Type: ApplicationFiled: December 21, 2009Publication date: December 29, 2011Applicant: Nec CorprationInventors: Takao Kawai, Satoshi Nakazawa, Shinichi Ando
-
Publication number: 20110282653Abstract: A text processing apparatus is provided with a segment determination unit 36 and a descriptive content determination unit 33. The segment determination unit 36 determines, with respect to a homogeneous segment that is similar to segments constituting a first text which is set as an analysis target (analysis target text) and that is included in another first text, whether the content thereof is included in a second text. The descriptive content determination unit 33 determines whether each segment constituting the analysis target text should be described in a corresponding second text, based on the determination result.Type: ApplicationFiled: December 21, 2009Publication date: November 17, 2011Inventors: Akihiro Tamura, Kai Ishikawa, Shinichi Ando
-
Publication number: 20110202545Abstract: The information extraction device for extracting specific information using information extraction rules comprises a case candidate extraction means for extracting new specific information that is not extracted by the information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; a rule candidate generation means for generating multiple extraction rule candidates based on the novel case candidates; a relation analysis means for analyzing the derivational relation between the novel case candidates and the extraction rule candidates and the overlapping relation between the multiple extraction rule candidates to generate relation analysis results; and a case candidate selection means for calculating the priorities of the novel case candidates based on the relation analysis results and previously prepared case information and selecting the novel case candidates according to the priority.Type: ApplicationFiled: January 6, 2009Publication date: August 18, 2011Inventors: Takao Kawai, Shinichi Ando
-
Publication number: 20110161368Abstract: A text mining apparatus, a text mining method, and a program are provided that accurately discriminate inherent portions of each of a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used performs text mining using, as targets, a plurality of text data pieces including a text data piece generated by computer processing. Confidence is set for each of the text data pieces. The text mining apparatus 1 includes an inherent portion extraction unit 6 that extracts an inherent portion of each text data piece relative to another of the text data pieces, using the confidence set for each of the text data pieces.Type: ApplicationFiled: August 28, 2009Publication date: June 30, 2011Inventors: Kai Ishikawa, Akihiro Tamura, Shinichi Ando
-
Publication number: 20110161367Abstract: A text mining apparatus, a text mining method, and a program are provided that enable the influence that computer processing errors have on mining results to be reduced during text mining performed on a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used includes an inherent portion extraction unit 6 that, for each of a plurality of text data pieces including a text data piece generated by computer processing, extracts an inherent portion of the text data piece relative to another of the text data pieces, an inherent confidence setting unit 7 that, for each inherent portion of each of the text data pieces, sets inherent confidence indicating confidence of the inherent portion, using the confidence that has been set for each of the text data pieces, and a mining processing unit 8 that performs text mining on each inherent portion of each of the text data pieces, using the inherent confidence.Type: ApplicationFiled: August 28, 2009Publication date: June 30, 2011Applicant: NEC CORPORATIONInventors: Kai Ishikawa, Akihiro Tamura, Shinichi Ando
-
Publication number: 20110153601Abstract: An information analysis apparatus 1 that executes information analysis on a document set including documents to which time information is attached, the apparatus includes: a corresponding section selection unit 30 that mutually compares a plurality of time-series data generated and selects two or more sections that change corresponding to each of two or more sections of another time-series data from each time-series data; a feature extraction unit 40 that extracts features from the documents belonging to the selected two or more sections; a comparison unit 50 that acquires, from extracted features, an inter-feature distance of the selected one section and another section, and mutually compares the inter-feature distances of each of the time-series data; and a correlation degree calculation unit 70 that calculates a degree of correlation between the document sets based on the comparison result.Type: ApplicationFiled: September 18, 2009Publication date: June 23, 2011Inventors: Satoshi Nakazawa, Shinichi Ando, Takao Kawai, Yuzuru Okajima
-
Publication number: 20110137641Abstract: An information analysis device (1) uses a plurality of linguistic expressions as an analysis target, includes a link information generating unit (3) and a correlation value calculation unit (4). The link information generating unit (3) extracts time information included in each of a plurality of electronic documents including at least any one of the plurality of linguistic expressions and a relationship between the electronic documents in the plurality of electronic documents from the plurality of electronic documents, detects a link between one linguistic expression and another linguistic expression in the plurality of linguistic expressions and an appearance time of the link based on the extracted time information and the relationship between the electronic documents, and generates link information specifying the extracted link and the appearance time of the link.Type: ApplicationFiled: September 4, 2009Publication date: June 9, 2011Inventors: Takao Kawai, Satoshi Nakazawa, Shinichi Ando
-
Publication number: 20110106849Abstract: A new case whose type is the same as that of a case about information desired to be extracted can be generated with high accuracy.Type: ApplicationFiled: March 9, 2009Publication date: May 5, 2011Applicant: NEC CorporationInventors: Takao Kawai, Shinichi Ando