Method and apparatus for converting an expression using key words

An expression converting method wherein for each sentence in a corpus, key words are selected from the sentence, a combination of key words that are in a co-occurrence relation is identified from among a predetermined number of combinations of key words among the selected key words, and the identified key word combination and an expression into which the sentence from which the key words are selected is converted are previously associated, and wherein predetermined key words are selected from an input sentence, the selected key words are combined, the key word combinations and the previously identified key word combination:of each sentence are compared, one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison are selected, and expressions into which the selected sentences are converted are output.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description

[0001] This application is a divisional of U.S. patent application Ser. No. 09/803,779, filed Mar. 12, 2001.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention relates to an expression converting method, an expression converting apparatus and a program that convert the expression of an input sentence and output an expression into which the expression of the input sentence is converted, for example, to an expression converting method, an expression converting apparatus and a program that perform language conversion such as translation or interpretation, sentence pattern conversion to convert written language into spoken language, or summary creation to summarize a complicated or redundant sentence and output a summary.

[0004] 2. Description of the Related Art

[0005] A conventional technology will be described with an interpretation software program as an example.

[0006] An interpretation software program comprises voice recognizing means and language translating means, and realizes interpretation by successively executing voice recognition for converting a voiced sentence input as an acoustic signal into an output sentence represented as a word text string, and language translation as expression conversion to translate an input sentence represented as a word text string into a sentence in another language.

[0007] The language translating means executing language translation in a manner as described above comprises: language analyzing means of analyzing the syntactic or semantic construction of the input sentence; language converting means of converting the input sentence into another language based on the result of the analysis; and output sentence generating means of generating a natural output sentence from the result of the translation.

[0008] In interpretation software programs and some translation software programs, to correctly translate ungrammatical casual expressions which are frequently used in spoken language, a technique is frequently used such that a language analysis according to grammatical rules is not performed but sentence examples similar to actually input sentences are learned and the similar sentence examples are searched to perform language analysis. An example of such conventional interpretation software programs will be described with reference to the example shown in FIG. 8.

[0009] Here, a case will be described in which a voiced sentence example in Japanese is interpreted into a voiced sentence example in English. The language to be interpreted will be referred to as the original language, and the language into which the original language is to be interpreted will be referred to as the target language.

[0010] Before interpretation is performed, a voiced sentence rule is extracted from the bilingual corpus 1 of FIG. 8. In the bilingual corpus 1, a plurality of bilingual voiced sentence examples each comprising a pair of a Japanese voiced sentence example and an English voiced sentence example equivalent to each other are written. In FIG. 9 (a), an example of the bilingual voiced sentence examples written in the bilingual corpus 1 is shown as a bilingual voiced sentence example 70.

[0011] Here, considering a case where some of the words are erroneously recognized or omitted when the sentence is voiced, voiced sentence examples are each divided into the smallest units as semantic units (hereinafter, referred to as phrases), and in-phrase rules and inter-phrase dependency relation rules are created.

[0012] First, phrase deciding means 61 divides the bilingual voiced sentence example into phrases. In FIG. 9 (b) , bilingual phrases thus obtained are shown as a bilingual phrase (A) 71 and a bilingual phrase (B) 72.

[0013] Then, a bilingual phrase dictionary creating portion 62 creates a corresponding phrase dictionary 62 in a format where the content words in the phrases are converted into variables.

[0014] For example, the bilingual voiced sentence example 70 shown in FIG. 9 (a) comprises voiced sentence examples “Heya no yoyaku o onegai shitain desuga ()” and “I'd like to reserve a room,” and these are divided into the following two bilingual phrases: (A) “heya no yoyaku ()” and “reserve a room” as the bilingual phrase (A) 71; and (B) “onegai shitain desuga ()” and “I'd like to” as the bilingual phrase (B) 72.

[0015] The content words such as “heya (),” “yoyaku (z,10 )” and “onegai ()” are represented as variables X, Y and Z, respectively, by use of a classified vocabulary table 64 previouslycreatedasshowninFIG. 9 (e). Here, the classified vocabulary table 64 is a table listing the content words which the variables can take as their values. For example, the variable X takes a value such as “heya (),” “kaigishitsu ()” or “kuruma (),” and the content word “heya ()” is a value which the variable X can take. Therefore, the content word “heya ()” of the bilingual phrase (A) 71 is replaced with the variable X.

[0016] In this manner, two bilingual phrase rules (A) “X no Y (XY)” “YX” and (B) “Z shitain desuga (Z)” “I'd like to” are written into the bilingual phrase dictionary 62.

[0017] In order that the ordinal relations of the phrases are made rules, as shown in FIG. 9 (d) as an inter-phrase rule 63, inter-phrase relations such as “(A) o (B) ((A)(B))” “(B) (A)” are stored in the inter-phrase rule table 63. This processing is performed on all the voiced sentences in the bilingual corpus 1.

[0018] In performing interpretation, first, an original language voice is input to voice recognizing means 64. The voice recognizing means 64 outputs as a voice recognition candidate the acoustically most similar word string, for example, from among the word strings written in the bilingual phrase dictionary 62 as phrases and the word strings that can be presumed from the phrase strings written in the inter-phrase rule 63.

[0019] Language translating means 65 receives consecutive word strings recognized in this manner, converts the input consecutive word strings into phrase strings written in the bilingual phrase dictionary 62, and searches for the inter- phrase rule 63 corresponding to each phrase string. Then, the language translating means 65 converts the input original language recognition result sentence into a target language sentence based on the target language phrase equivalent to each phrase and the inter-phrase rule of the target language.

[0020] The obtained target language sentence is input to output sentence generating means 66 which corrects grammatical unnaturalness of the target language sentence. For example, the output sentence generating means 66 performs processing such as optimization of a pronoun, a verb and an auxiliary verb, for example, conversion into the third person form, the plural form or the past form, and optimization of the overall structure of the sentence. The target language translation result sentence having undergone the correction is output, for example, as a text.

[0021] However, the conventional interpretation software program has, although having an advantage that ungrammatical input sentences can be handled, a problem that since a multiplicity of different bilingual phrases and combinations thereof are written as rules as they are, the conversion rules are complicated and enormous in volume and consequently, it takes much time for the program to perform processing.

[0022] Moreover, complicated rules are necessary for the grammar check performed by the output sentence generating portion; particularly, with respect to interpolation of zero pronouns, there is no technology by which zero pronouns can completely and correctly be interpolated, and interpolation is sometimes erroneously performed.

[0023] In addition, when a partially erroneous sentence is input to the language translating portion because of a voice recognition error or the like, since language conversion is performed based on erroneous in-phrase and inter-phrase rules, a translation result conveying no intention at all is output.

SUMMARY OF THE INVENTION

[0024] An object of the present invention is, in view of the above-mentioned problems, to provide an expression converting method, an expression converting apparatus and a program being compact in structure and capable of high- speed processing.

[0025] Another object of the present invention is, in view of the above-mentioned problems, to provide an expression converting method, an expression converting apparatus and a program capable of, even when a part other than the key words of the input sentence is erroneously recognized because of a voice recognition error or the like, outputting a result correctly conveying the intention without the quality of the output sentence adversely affected.

[0026] Still another object of the present invention is, in view of the above-mentioned problems, to provide an expression converting method, an expression converting apparatus and a program capable of, even when a part of the input sentence is erroneously recognized because of a voice recognition error or the like, avoiding the conventional problem that a result not conveying the sentence meaning at all is output.

[0027] One aspect of the present invention is an expression converting method wherein for each sentence in a corpus, key words are selected from the sentence, a combination of key words that are in a co-occurrence relation is identified from among a predetermined number of combinations of key words among the selected key words, and the identified key word combination and an expression into which the sentence from which the key words are selected is converted are previously associated, and

[0028] wherein predetermined key words are selected from an input sentence, the selected key words are combined, the key word combinations and the previously identified key word combination of each sentence are compared, one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison are selected, and expressions into which the selected sentences are converted are output.

[0029] Another aspect of the present invention is an expression converting method wherein by use of classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, for each sentence in a corpus, key words are selected from the sentence, a combination of classes that are in a co-occurrence relation are identified from among a predetermined number of combinations of classes among classes to which the selected key words belong, and the identified class combination and an expression into which the sentence from which the key words are selected is converted are previously associated, and

[0030] wherein predetermined key words are selected from an input sentence, classes to which the selected key words belong are combined, the class combinations and the previously identified class combination of each sentence are compared, one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison are selected, and expressions into which the selected sentences are converted are output.

[0031] Still another aspect of the present invention is an expression converting method wherein for each sentence in a corpus, key words are selected from the sentence, a combination of key words that are in a co-occurrence relation is identified from among a predetermined number of combinations of key words among the selected key words, and the identified key word combination and an expression into which the sentence from which the key words are selected is converted are previously associated,

[0032] wherein by use of classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, the identified key word combination is associated with a class combination to thereby identify a class combination of the sentence, and

[0033] wherein predetermined key words are selected from an input sentence, classes to which the selected key words belong are combined, the class combinations and the previously identified class combination of each sentence are compared, one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison are selected, and expressions into which the selected sentences are converted are output.

[0034] Yet another aspect of the present invention is an expression converting apparatus comprising:

[0035] associating means of, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of key words that are in a co-occurrence relation from among a predetermined number of combinations of key words among the selected key words, and previously associating the identified key word combination and an expression into which the sentence from which the key words are selected is converted; and

[0036] converting means of selecting predetermined key words from an input sentence, combining the selected key words, comparing the key word combinations and the previously identified key word combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0037] Still yet another aspect of the present invention is an expression converting apparatus, wherein when the degree of similarity is high as the result of the comparison, said converting means outputs the selected expression after removing a part into which a key word is converted is removed from the selected expression, said key word belonging to the key word combination that does not coincide and not being included in the key word combination that coincides.

[0038] A further aspect of the present invention is an expression converting apparatus, wherein said expression into which the sentence is converted comprises only key words or words equivalent to the key words.

[0039] A still further aspect of the present invention is an expression converting apparatus comprising:

[0040] associating means of, by use of classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, for each sentence in a corpus, selecting key words from the sentence; identifying a combination of classes that are in a co-occurrence relation from among a predetermined number of combinations of classes among classes to which the selected key words belong, and previously associating the identified class combination and an expression into which the sentence from which the key words are selected is converted; and

[0041] converting means of selecting predetermined key words from an input sentence, combining classes to which the selected key words belong, comparing the class combinations and the previously identified class combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0042] A yet further aspect of the present invention is an expression converting apparatus comprising:

[0043] associating means of, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of key words that are in a co-occurrence relation from among a predetermined number of combinations of key words among the selected key words, and previously associating the identified key word combination and an expression into which the sentence from which the key words are selected is converted, and

[0044] by use of classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, associating the identified key word combination with a class combination to thereby identify a class combination of the sentence; and

[0045] converting means of selecting predetermined key words from an input sentence, combining classes to which the selected key words belong, comparing the class combinations and the previously identified class combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0046] A still yet further aspect of the present invention is an expression converting apparatus, wherein when the degree of similarity is high as the result of the comparison, said converting means outputs the selected expression after removing a part into which a class is converted is removed from the selected expression, said class belonging to the class combination that does not coincide and not being included in the class combination that coincides.

[0047] An additional aspect of the present invention is an expression converting apparatus, wherein said expression into which the sentence is converted comprises only class.

[0048] A still additional aspect of the present invention is a program for causing a computer to function as all or part of the following means of the expression converting apparatus:

[0049] the associating means of, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of key words that are in a co-occurrence relation from among a predetermined number of combinations of key words among the selected key words, and previously associating the identified key word combination and an expression into which the sentence from which the key words are selected is converted; and

[0050] the converting means of selecting predetermined key words from an input sentence, combining the selected key words, comparing the key word combinations and the previously identified key word combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0051] A yet additional aspect of the present invention is a program for causing a computer to function as all or part of the following means of the expression converting apparatus:

[0052] the associating means of, by use of the classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of classes that are in a co-occurrence relation from among a predetermined number of combinations of classes among classes to which the selected key words belong, and previously associating the identified class combination and an expression into which the sentence from which the key words are selected is converted; and

[0053] the converting means of selecting predetermined key words from an input sentence, combining classes to which the selected key words belong, comparing the class combinations and the previously identified class combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0054] A still yet additional aspect of the present invention is a program for causing a computer to function as all or part of the following means of the expression converting apparatus:

[0055] the associating means of, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of key words that are in a co-occurrence relation from among a predetermined number of combinations of key words among the selected key words, and previously associating the identified key word combination and an expression into which the sentence from which the key words are selected is converted, and

[0056] by use of the classing information in which key words are previously classed based on predetermined properties and each class is provided with a name, associating the identified key word combination with a class combination to thereby identify a class combination of the sentence; and

[0057] the converting means of selecting predetermined key words from an input sentence, combining classes to which the selected key words belong, comparing the class combinations and the previously identified class combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted.

[0058] Next, operations of the present invention will be described.

[0059] According to the present invention, by extracting key words from the input sentence, converting the input sentence into a standard or simplified expression sentence representative of the same meaning by use of the extracted key words, and outputting the expression sentence, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing. Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0060] Moreover, according to the present invention, by extracting, as key words, words which are some of the content words included in the input sentence or the words into which the words which are some of the content words are converted, and generating a standard or simplified expression sentence comprising a combination of the key words and the expression decided by the sentence meaning presumed from the input sentence, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing. Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0061] Moreover, according to the present invention, by extracting key words from the input sentence, presuming the sentence meaning from the co-occurrence relation between the extracted key words or the co-occurrence relation, and generating a standard or simplified expression from a combination of only words predetermined from the key words or the equivalents of the key words and the presumed intention, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing. Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0062] Moreover, according to the present invention, by previously creating an example database in which output standard or simplified expression patterns, the key word groups corresponding to the patterns and the co-occurrence relation between key word groups or the co-occurrence relation are associated, extracting a key word group from the input sentence, selecting from the example database an expression pattern including the key word group most similar to the extracted key word group, and outputting the selected sentence example, in addition to the above- mentioned effects of the present invention, expression conversion can accurately be performed faithfully to the kind, the domain and the sentence pattern of the actually input sentence.

[0063] Moreover, according to the present invention, by the expression patterns written in the example database each consisting of only key words or equivalents of the key words, the effects can be enhanced.

[0064] Moreover, according to the present invention, by the description of the expression patterns written in the example database, the key word groups and the co-occurrence relation between the key word groups or the key words that are in a co-occurrence relation being a description of classes of words including the key words, in addition to the above-mentioned effects of the present invention, even when a key word not included in:the example database is input, an appropriate sentence example can be selected, so that expression conversion capable of handling a wider variety of input sentences is enabled.

[0065] Moreover, according to the present invention, by extracting a key word group from the input sentence, presuming an input error word from the relation between the extracted key words, presuming the sentence meaning from the key words other than the key word presumed to be an input error word, and generating a standard or simplified expression from a word combination decided by the presumed sentence meaning, in addition to the effects described above in the present invention, even when a key word is erroneous, according to the degree of seriousness of the error, it is possible to convert the input sentence into an expression of which meaning can correctly be understood or to notify the user that the meaning cannot be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

BRIEF DESCRIPTION OF THE DRAWINGS

[0066] FIG. 1 is a view showing an interpreting apparatus according to a first embodiment of the present invention;

[0067] FIG. 2(a) is a view showing an example of a bilingual key word dictionary and an example of an example DB used in the first embodiment of the present invention;

[0068] FIG. 2(b) is a view showing an example of a tagged corpus used in the first embodiment of the present invention;

[0069] FIG. 3 is a view showing the example DB used in the first embodiment of the present invention;

[0070] FIG. 4 is a view showing an interpreting apparatus according to a second embodiment of the present invention;

[0071] FIG. 5(a) is a view showing a classified vocabulary table used in the second embodiment of the present invention;

[0072] FIG. 5(b) is a view showing an example DB used in the second embodiment of the present invention;

[0073] FIG. 6 is a view showing an interpreting apparatus of a third embodiment of the present invention;

[0074] FIG. 7 is a view showing an example DB used in the third embodiment of the present invention;

[0075] FIG. 8 is a view showing the structure of the conventional interpreting apparatus;

[0076] FIG. 9(a) is a view showing the example of the conventional bilingual voiced sentence examples;

[0077] FIG. 9(b) is a view showing the examples of the conventional bilingual phrases;

[0078] FIG. 9(c) is a view showing the example of the conventional bilingual phrase dictionary;

[0079] FIG. 9(d) is a view showihg the example of the conventional inter-phrase rules; and

[0080] FIG. 9(e) is a view showing the example of the conventional classified vocabulary table.

[0081] FIG. 10(a) is a view showing an example of a bilingual key word dictionary and an example of an example DB used in case of converting an English sentence into a Japanese sentence in the first embodiment of the present invention;

[0082] FIG. 10(b) is a view showing an example of a tagged corpus used in case of converting an English sentence into a Japanese sentence in the first embodiment of the present invention;

[0083] FIG. 11(a) is a view showing a classified vocabulary table used in case of converting an English sentence into a Japanese sentence in the second embodiment of the present invention;

[0084] FIG. 11(b) is a view showing an example DB used in case of converting an English sentence into a Japanese sentence in the second embodiment of the present invention;

[0085] FIG. 12 is a view showing an example DB used in case of converting an English sentence into a Japanese sentence in the third embodiment of the present invention;

[0086] FIG. 13(a) is a view showing an example of a bilingual key word dictionary and an example of an example DB used in case of converting a Chinese sentence into Japanese sentence in the first embodiment of the present invention;

[0087] FIG. 13(b) is a view showing an example of a tagged corpus used in case of converting a Chinese sentence into Japanese sentence in the first embodiment of the present invention;

[0088] FIG. 14(a) is a view showing a classified vocabulary table used in case of converting a Chinese sentence into Japanese sentence in the second embodiment of the present invention;

[0089] FIG. 14(b) is a view showing an example DB used incase of converting a Chinese sentence into Japanese sentence in the second embodiment of the present invention;

[0090] FIG. 15 is a view showing an example DB used in case of converting a Chinese sentence into Japanese sentence in the third embodiment of the present invention;

Explanation of Reference Numerals

[0091] 1 Tagged corpus

[0092] 2 Dependency relation analyzing means

[0093] 3, 11 Example database

[0094] 4 Speech recognizing means

[0095] 5 Key word extracting means

[0096] 6, 16 Bilingual key word dictionary

[0097] 7, 14 Sentence example selecting means

[0098] 8, 15 Output sentence generating means

[0099] 12 Classified vocabulary table

[0100] 13 Word classing means

[0101] 21 Erroneously recognized word presuming means

DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0102] Hereinafter, embodiments of the present invention will be described with reference to the drawings.

First Embodiment

[0103] First, a first embodiment will be described. In the first embodiment, an interpreting apparatus will be described that converts an original language sentence input by voice (hereinafter, a sentence to be expression-converted will be referred to as an original language sentence) into a target language sentence in another language (hereinafter, a sentence having undergone the expression conversion will be referred to as a target language sentence).

[0104] FIG. 1 shows the structure of an interpreting apparatus according to an embodiment of the present invention.

[0105] The interpreting apparatus of this embodiment comprises a tagged corpus 1, dependency relation analyzing means 2, an example DB 3, speech recognizing means 4, key word extracting means 5, sentence example selecting means 7, output sentence generating means 8, and a bilingual key word dictionary 6.

[0106] The tagged corpus 1 is a bilingual corpus in which an intention tag is added to each of the bilingual sentences.

[0107] The dependency relation analyzing means 2 creates the example DB 3 by analyzing a co-occurrence relation between key words for each of the bilingual sentences in the tagged corpus 1.

[0108] In the example DB 3, sentence examples in which key word pairs representative of co-occurrence relations between key words of original language sentences are associated with target language sentences are stored.

[0109] The voice recognizing means 4 voice-recognizes the voice input as an original language sentence, and outputs a word string candidate.

[0110] The key word extracting means 5 receives the word string candidate output from the voice recognizing means 4, and extracts predetermined key words from the word string candidate.

[0111] The sentence example selecting means 7 compares key word pairs created by combining the key words in the input sentence with the key word pairs in each sentence example in the example DB 3, selects a sentence example the largest number of which key word pairs is included in the input sentence, as the sentence example of which meaning is most similar to that of the input sentence, and outputs the selected sentence example.

[0112] The output sentence generating means 8 corrects grammatical unnaturalness of the output sentence example, and outputs the corrected sentence example as a target language sentence, for example, in text form or by voice.

[0113] In the bilingual key word dictionary 6, key words in the original language (hereinafter, the language to be expression-converted will be referred to as the original language) and the equivalents in the target language (hereinafter, the language of the sentence having undergone the expression conversion will be referred to as the target language) for the key words are paired and stored.

[0114] Next, an operation of this embodiment structured as described above will be described.

[0115] It is necessary for the interpreting apparatus of this embodiment to do the following before performing interpretation: deciding the contents of key words and the correspondence between key word groups and target language expression patterns; writing the equivalents in the target language for the key words into the bilingual key word dictionary 6; and writing the correspondence between the key word groups and the target language expression patterns into the example DB 3.

[0116] To do so, first, for each input sentence meaning, key words representative of an intention, and expression patterns using the key words may be manually decided by the developer. Alternatively, the following may be performed: An intention tag is added to each of the bilingual sentences in a bilingual corpus, the bilingual corpus is classified according to the intention, words shared among the sentence meanings are selected as key word candidates, and key words and expression patterns are semiautomatically decided by the developer checking the key word candidates. The sentence meaning refers to a unit of one or more than one different sentences expressinga similar intention. The bilingual corpus is a database of sentence examples in which a multiplicity of bilingual sentences are stored. The bilingual sentences each comprise a sentence in the original language and a sentence in the target language associated with each other.

[0117] From the key words and the expression patterns decided by any of the above-described methods, a bilingual key word dictionary and an example DB for conversion are created. FIG. 2(a) shows an example of the bilingual key word dictionary 6 and an example of the example DB 3 used in a case where the original language is Japanese and the target language is English, that is, in a case where the interpreting apparatus of this embodiment interprets a voice in Japanese into a voice in English.

[0118] In the example of the bilingual key word dictionary 6 of FIG. 2(a), as the equivalent of a Japanese word “kohi (),” an English word “coffee” is written, and as the equivalent of a Japanese word “miruku (),” an English word “milk” is written. As the equivalent of a Japanese word “onegai ()” which cannot be represented by a single English word, “*” is written in the place where the English equivalent is to be written.

[0119] In the example of the example DB 3 of FIG. 2(a), a key word group of “kohi ()” and “onegai ()” is associated with a target language expression pattern “I'd like to coffee please.” Likewise, a key word group of “tsumetai (),” “miruku ()” and “ari ()” is associated with a target language expression pattern “Do you have a cold milk?”

[0120] Moreover, in the example of the example DB 3 of FIG. 2(a), in each key word group, the key words are paired like (kohi ()→miruku ()). Like this, the key words written in the example DB 3 are paired without exception. These key word pairs each represent a co-occurrence relation between the key words, and are created by the dependency relation analyzing means 2 in the following manner:

[0121] First, the dependency relation analyzing means 2 performs a dependency structure analysis for the original language sentence in the tagged corpus 1 to clarify the dependency structure ofeachphrase. Whenapairofkeywordsthatareinaco-occurrence relation in the dependency structure relation is present, the information thereon is added to the corresponding key words and expressionpattern pair in the example DB 3. Specifically, since key words “kohi ()” and “onegai ()” are in a co-occurrence relation for an original language sentence “kohi onegai (),” as shown in FIG. 2(a), the co-occurrence relation is added like “(kohi ()→onegai ())” where the key words are paired.

[0122] In this manner, the bilingual key word dictionary 6 and the example DB 3 as shown in FIG. 2(a) are created from the tagged corpus 1, and the co-occurrence relations between the key words are added to the example DB 3.

[0123] Next, an operation to perform interpretation by use of the example DB 3 and the bilingual key word dictionary 6 previously created in the above-described manner will be described.

[0124] In performing interpretation, first, the speech recognizing means, or the voice recognizing means 4 voice-recognizes the input original language voice, and outputs a word string candidate which is the result of the recognition. Then, the result of the recognition is input to the key word extracting means 5. For example, when the input original language voice is “Tsumetai kohi arimasuka? (?),” as a result of the voice recognition, a recognition result sentence “Tsumetai kohi arimasuka? (?)” is output to the key word extracting means.

[0125] Then, the key word extracting means 5 extracts predetermined key words from the recognition result sentence, and outputs the extracted key words to the sentence example selecting means 7. For example, from the recognition result sentence “Tsumetai kohi arimasuka? (?),” the key word extracting means 5 extracts three key words “tsumetai (),” “kohi ()” and “ari().”

[0126] Then, the sentence example selecting means 7 creates key word pairs in the input sentence by combining the key words in the input sentence that are output from the key word extracting means 5. Then, the sentence example selecting means 7 compares the key word pairs in the input sentence with the key. word pairs in each sentence example in the example DB 3, selects a sentence example including the largest number of the key word pairs in the input sentence, as the sentence example of which meaning is most similar to that of the input sentence, and outputs the selected sentence example.

[0127] For example, the three key words “tsumetai (),” “kohi ()” and “ari ()” in the input sentence are combined to create three key word pairs “(tsumetai ()→kohi ()),” “(kohi()→ari ())” and “(tsumetai ()→ari ()).”

[0128] Then, the key word pairs in the input sentence are compared with the key word pair in the first sentence example in the example DB 3. Since the key word pair in the first sentence example is “(kohi ()→onegai (),” none of the key word pairs in the input sentence is included in the first sentence example.

[0129] Then, the key word pairs in the input sentence are compared with the key word pairs in the second sentence example in the example DB 3. There are two key word pairs “(tsumetai ()→miruku ())” and “(miruku ()→ari ())” in the second sentence example. Of the key word pairs in the input sentence, “(tsumetai ()→miruku ())” and “(miruku ()→ari ())” are included although “(tsumetai ()→ari ())” is not included. Therefore, the second sentence example includes two of the key word pairs in the input sentence.

[0130] Assume that as a result of comparing the key word pairs in the input sentence with the key word pairs in all the sentence examples in the example DB 3 in this manner, the second sentence example includes the largest number of the key word pairs in the input sentence. Then, the second sentence example is selected as the sentence example of which meaning is most similar to that of the input sentence, and is output. That is, a target language expression pattern “Do you have a cold milk?” is output from the sentence example selecting means 7.

[0131] While the output sentence generating means 8 is shown in FIG. 1, it is not necessary to provide the output sentence generating means 8; the apparatus can function as an interpreting apparatus by outputting the target language expression pattern output from the sentence example selecting means 7, to the outside as it is.

[0132] As described above, according to this embodiment, the example DB 3 in which standard or simplified expression patterns of output target language sentences, the key word groups corresponding to the patterns and the co-occurrence relation between the key words are associated is previously created, key words are extracted from the input sentence, the expression pattern including the keyword group most similar to the extracted key word group is selected from the example DB 3, and the selected expression pattern is output. Consequently, even in a case where the output sentence generating means 8 is provided and the output sentence generating means 8 uses conversion rules and sentence generation rules to perform correction of grammatical unnaturalness such as: optimization of a pronoun, a verb and an auxiliary verb, for example, conversion into the third person form, the plural form or the past form; interpolation of zero pronouns; and optimization of the overall structure of the sentence, the conversion rules and the sentence generation rules canbemadecompact, sothatexpressionconversioncanbeperformed at high speed with simple processing.

[0133] Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output canbe solved. For example, evenwhena sentence such as “Tsumetai miruku etto arimasuka ()” or “Tsumetaino miruku arimasuka ()” that is erroneous in a part other than the key words such as “etto ()” or “no ()” is input, the input sentence can be converted into an expression of which meaning can correctly be understood like “Do you have a cold milk?”

[0134] While a case where only one sentence example is selected by the sentence example selecting means 3 is described in this embodiment, when more than one sentence examples have an equal degree of similarity as a result of comparing the extracted key word pairs with the key word pairs in the sentence examples in the example DB 3, the more than one sentence examples are output.

[0135] In a case where the expression patterns written in the previously created example DB 3 consist of only key words as shown in FIG. 3, it is unnecessary to provide the output sentence generating means 8, or when the output sentence generating means 8 is provided and the conversion rules and the sentence generation rules are used, the conversion rules and the sentence generation rules can further be made compact, so that a highly effective interpreting apparatus can be realized.

Second Embodiment

[0136] Next, a second embodiment will be described.

[0137] FIG. 4 shows the structure of an interpreting apparatus according to an embodiment of the present invention.

[0138] An example DB 11 is different from the example DB of the first embodiment in that key words are classed and replaced with meaning codes representative of classes.

[0139] In a classified vocabulary table 12, each key word is assigned a meaning code representative of which class the key word belongs to.

[0140] Word classing means 13 replaces the key words in the example DB 11, the pair of key words that are in a co-occurrence relation and the key words included in the conversion rules with meaning codes based on the meaning codes in the classified vocabulary table 12 to thereby class the key words, and replaces the key words extracted by key word extracting means 5 with meaning code based on the meaning codes in the classified vocabulary table to thereby class the key words.

[0141] Sentence example selecting means 14 compares the key word classes in the input sentence with the key word class pairs in each sentence example, and selects a sentence example the largest number of which key word class pairs is included in the input sentence, as the sentence example of which meaning is most similar to that of the input sentence.

[0142] Output generating means 15 returns the key word class in the selected sentence example to the input key words included in the same class, and outputs the key words.

[0143] Except these, the structure is the same as that of the first embodiment.

[0144] Next, an operation of this embodiment structured as described above will be described.

[0145] In the interpreting apparatus of this embodiment, like in the first embodiment, before interpretation is performed, the contents of key words and the correspondence between key word groups and target language expression patterns are decided and written into a bilingual key word dictionary 16 and the example DB 11. Then, the dependency relation analyzing means 2 similarly adds the pair of key words that are in a co-occurrence relation in the dependency structure relation to the expression pattern pair.

[0146] Further, the word classing means 13 replaces the key words in the example DB 11, the pair of key words that are in a co-occurrence relation and the key words included in the conversion rules with meaning codes based on the meaning codes in the classified vocabulary table 12 to thereby class the key words.

[0147] FIG. 5(a) shows an example of the classified vocabulary table 12. FIG. 5(b) shows the example DB 11.

[0148] In the classified vocabulary table 12 shown in FIG. 5(a), the meaning code of words representative of beverages such as “kohi ()” is 100, and the meaning code representative of the condition of a beverage such as “tsumetai ()” is 200. Like this, the key words in the classified vocabulary table 12 are each classed by being assigned a meaning code.

[0149] In the example of the example DB 11 shown in FIG. 5(b), using the meaning codes assigned to the key words in the classified vocabulary table 12, the key words that occur in the example of the example DB 3 of FIG. 2 described in the first embodiment are represented by meaning codes.

[0150] Next, an operation to perform interpretation by use of the example DB 11 and the bilingual key word dictionary 16 previously created in the above-described manner will be described.

[0151] In performing interpretation, first, an original language voice is input to the speech recognizing means 4, and the voice recognizing means 4 voice-recognizes the input original language voice, and outputs a word string candidate which is the result of the recognition. Then, the result of the recognition is: input to the key word extracting means 5.

[0152] Then, the key word extracting means 5 extracts predetermined key words from the recognition result sentence.

[0153] Then, the word classing means 13 replaces the extracted key words with meaning codes based on the meaning codes in the classified word meaning 12 to thereby class the key words.

[0154] Then, the sentence example selecting means 14 creates key word class pairs in the input sentence by combining the classes of the key words in the input sentence that are classed by the word classing means 13. Then, the sentence example selecting means 14 compares the key word class pairs in the input sentence with the key word class pair in each sentence example in the example DB 11, and selects a sentence example the largest number of which key word class pairs is included in the input sentence, as the sentence example of which meaning is most similar to that of the input sentence.

[0155] The output sentence generating means 15 returns the key word classes in the sentence example selected by the sentence example selecting means 14 to the input key words included in the same class, replaces the key words with the equivalents of the input key words by use of the bilingual key word dictionary 16, and outputs them.

[0156] While a case where only one sentence example is selected by the sentence example selecting means 14 is described in this embodiment, when more than one sentence examples have an equal degree of similarity as a result of comparing the pairs of the classes of the extracted key words with the class pairs in the sentence examples in the example DB 3, the more than one sentence examples are output.

[0157] As described above, according to this embodiment, the example DB 11 in which standard or simplified expression patterns of output target language sentences, the key word class groups corresponding to the patterns and the co-occurrence relation between the key word class groups are associated is previously created, key words are extracted from the input sentence, the extracted key words are classed based on the classified vocabulary table 12, the expression pattern having the key word class group most similar to the classed key word group is selected from the example DB 11, and the classed key words are returned to the original key words and output, so that the conversion rules and the sentence generation rules can be made compact like in the first embodiment and expression conversion can be performed with simple processing.

[0158] Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0159] Further, by the word classing, a correct translation result can be output for a new input sentence not included in the sentence examples, so that an interpreting apparatus can be realized that is capable of handling a multiplicity of input sentences with a small example database.

[0160] In this embodiment, before interpretation is performed, the contents of key words and the correspondence between key word groups and target language expression patterns are decided and written into the bilingual key word dictionary 16 and the example DB 11, the pair of key words that are in a co-occurrence relation in the dependency structure relation is also added to the expression pattern pair, and then, the word classing means 13 replaces the key words in the example DB 11, the pair of key words that are in a co-occurrence relation and the key words included in the conversion rules with meaning codes based on the meaning codes in the classified vocabulary table 12 to thereby class the key words. However, the present invention is not limited thereto, and the following may be performed: First, before interpretation is performed, for each sentence in the tagged corpus 1, the key words in the sentence are replaced with meaning codes by use of the classified vocabulary table 12, from among a predetermined number of combinations of meaning codes among the meaning codes, a combination of meaning codes that are in a co-occurrence relation is identified, and the identified meaning code combination and the expression into which the sentence from which the meaning codes are selected is converted are previously associated.

Third Embodiment

[0161] Next, a third embodiment will be described. Like the first and the second embodiments, the third embodiment will be described with an interpreting apparatus as an example.

[0162] FIG. 6 shows the structure of the interpreting apparatus of this embodiment.

[0163] Sentence example selecting means 21, in addition to performing the function of the sentence example selecting means 7 of the first embodiment, compares the key word pairs in the example DB 3 and the extracted key word groups, and selects a sentence example on the presumption that, of the extracted key words, a key word that forms a key word pair, based on a co-occurrence relation, with none of the other key words is an erroneously recognized word.

[0164] Except this, the structure is the same as that of the first embodiment.

[0165] Next, an operation of this embodiment structured as described above will be described.

[0166] In the interpreting apparatus of this embodiment, like in the first embodiment, before interpretation is performed, the contents of key words and the correspondence between key word groups and target language expression patterns are decided and written into the bilingual key word dictionary 6 and the example DB 3, and the co-occurrence relation between key words are also added to the example DB 3.

[0167] Next, an operation to perform interpretation will be described.

[0168] In performing interpretation, first, an original language voice is input to the speech recognizing means 4, and the voice recognizing means 4 voice-recognizes the original language voice, and outputs a word string candidate which is the result of the recognition.

[0169] The key word extracting means 5 receives the result of the recognition output from the speech recognizing means 4, and extracts predetermined key words from the recognition result sentence.

[0170] Then, the sentence example selecting means 21 compares the key word pairs in the example DB 3 and the extracted key word pairs, and presumes, of the extracted key words, a key word that forms a key word pair, based on a co-occurrence relation, with none of the other key words to be an erroneously recognized word.

[0171] Then, by use of the key words other than the key word presuned to be erroneously recognized, the sentence example selecting means 21 selects a sentence, and outputs the selected sentence example like in the first embodiment.

[0172] FIG. 7 shows an example in which although “atsui miruku ()” is input to the voice recognizing means 4, the voice recognizing means 4 erroneously recognizes it as “aol miruku (),” that is, although a voice “Atsui miruku ha arimasuka (z,58 )” is input to the voice recognizing means 4, the voice recognizing means 4 erroneously recognizes it, and outputs a recognition result sentence “Aoi miruku ha arimasuka ().”

[0173] Moreover, in the example of the example DB 3 of FIG. 7, a key word group of “kohi ()” and “onegai (),” constitutes a key word pair “(kohi ()→onegai ()),” and a target language expression pattern “Coffee please” corresponds thereto. Moreover, a key word group of “atui (),” “miruku ()” and “ari ()” includes key words pairs “(atui ()→miruku ())” and “(miruku ()→ari ()),” and a target language expression pattern “Do you have a hot milk?” corresponds thereto.

[0174] When the speech recognizing means 4 erroneously recognizes the input voice as described above, the key word extracting means 5 receives a recognition result sentence “Aoi miruku ha arimasuka (),” and extracts three key words “aoi (∇),” “miruku()” and “ari ()” as they key words.

[0175] In such a case, the sentence example selecting means 21 combines the three key words extracting means 5 to create theree key word pairs “(aoi ()→miruku ()),” “(aoi ()→ari ())” and “(miruku ()→ari (),” and compares them with the key word groups in the example DB 3.

[0176] That is, first, the key word pair “(kohi ()→onegai ())” in the first sentence example written in the example DB 3 and three key word pairs created by combining the three key words are compared. As a result of the comparison, a key word pair that coincides with the key word pair “(kohi ()→onegai ())” in the first sentence example is absent.

[0177] Then, the key word pairs “(atsui ()→miruku ())” and “(miruku ()→ari ())” in the second sentence example written in the example DB 3 and the three key word pairs created by combining the three key words are compared. As a result of the comparison, of the key word pairs created by combining the three key words, “(miruku ()→ari ())” coincides with a key word pair in the second sentence although “(aoi ()→miruku ())” and “(aoi ()→ari ())” coincide with none of the key word pairs in the second example.

[0178] That is, the key word pairs in the second sentence example in the example DB has a higher degree of similarity to the key word pairs created by combining the three key words than the key word pair in the first sentence example. Therefore, the second sentence example is selected.

[0179] Further, the second sentence example is output after the part of the target language expression corresponding to the following key word is removed from the second sentence example: the key word included in the one of the key word pairs in the second sentence that coincides with none of the key word pairs created by combining the three key words which key word is not included in the one of the key word pairs that coincides with the key word pair created by combining the three key words. That is, since “atsui ()” is such a key word, referring to the bilingual key word dictionary 6, the part of the target language expression corresponding to “atsui (),” that is, “hot” is removed from “Any hot milk?,” and a sentence “Any milk?” is output.

[0180] Thus, when the three key word pairs “(aoi ()→miruku ()),” “(aoi ()→ari ())” and “(miruku ()→ari ())” and the co-occurrence relations, that is, the key word pairs in the example DB 3 are compared, since a word that is in a co-occurrence relation with “atsui ()” is absent in the recognition result, it is presumed that “atsui ()” is erroneously recognized, and a sentence “Any milk?” is output as mentioned above.

[0181] While a case where only one sentence example is selected by the sentence example selecting means 21 is described in this embodiment, when more than one sentence examples have an equal degree of similarity as a result of comparing the extracted key word pairs with the key word pairs in the sentence examples of the example DB 3, the more than one sentence examples are output.

[0182] The function of the sentence example selecting means 21 in this embodiment is not limited to the above-described one, and may be divided into two steps of erroneous recognition selecting means and sentence selecting means. In this case, erroneously recognized word presuming means presumes an erroneously recognized word by comparing the key word group extracted from the recognition result and the key word pairs written in the example DB 3, and the sentence example selecting means has a similar function to the sentence selecting means 7 of the first embodiment.

[0183] In this case, in performing interpretation, the erroneously recognized word presuming means 21 compares the created key word pairs with the key word pairs in each sentence example of the example DB 3, and presumes, of the extracted key words, a key word that forms a key word pair, based on a co-occurrence relation, with none of the other key words to be an erroneously recognized word. Then, by use of the key words other than the key word presumed by the erroneously recognized word presuming means to be erroneously recognized, the sentence example selecting means selects a sentence example of which intention is most similar to that of the input sentence, and outputs the selected sentence example like in the first embodiment. By doing this, even when an erroneously recognized part is included in the result of the recognition by the voice recognizing means 4 as mentioned above and the voice recognizing means 4 outputs a recognition result, “Any milk?” can be output like in the previously-described case by the recognition result being processed by the erroneously recognized word presuming means and the sentence example selecting means.

[0184] It is explained in first to third embodiments above that an original language sentence is a Japanese sentence and a target language sentence is an English sentence, or that the interpreting apparatus converts a Japanese sentence into an English sentence, but an original language sentence and a target language sentence may be other language sentences.

[0185] The difference from above embodiments will be mainly described below in case of an interpreting apparatus converting an English sentence into a Japanese sentence and in case of an interpreting apparatus converting a Chinese sentence into a Japanese sentence respectively.

[0186] First, in the case of an interpreting apparatus converting an English sentence into a Japanese sentence, FIG. 2, FIG. 5, and FIG. 7 are replaced by FIG. 10, FIG. 11 FIG. 12 respectively.

[0187] That is, FIG. 10(a) is an example of a Bilingual key word dictionary 6 and an example database 3 in this case. FIG. 10(b) is an example of the tagged corpus 1 in this case.

[0188] FIG. 11(a) is an example of a classified vocabulary table 12 in this case. FIG. 11(b) is an example of an example database 11.

[0189] FIG. 12 is an example of an example database 3 in this case.

[0190] It is clear that the each above-mentioned embodiment can be applied to the case where an interpreting apparatus converts an English sentence into a Japanese sentence.

[0191] Second, in the case of a interpreting apparatus converting a Chinese sentence into a Japanese sentence, FIG. 2, FIG. 5, and FIG. 7 are replaced by FIG. 13, FIG. 14 FIG. 15 respectively.

[0192] That is, FIG. 13(a) is an example of a Bilingual key word dictionary 6 and an example database 3 in this case. FIG. 13(b) is an example of the tagged corpus 1 in this case.

[0193] FIG. 14(a) is an example of a classified vocabulary table 12 in this case. FIG. 11(b) is an example of an example database 11.

[0194] FIG. 15 is an example of an example database 3 in this case.

[0195] It is clear that the each above-mentioned embodiment can be applied to the case where an interpreting apparatus converts a Chinese sentence into a Japanese sentence.

[0196] While the dependency relation analyzing means 2 pairs key words that are in a co-occurrence relation in this embodiment, the present invention is not limited thereto; the dependency relation analyzing means 2 may pair key words that are in a dependency relation.

[0197] The tagged corpus 1 of this embodiment is an example of the corpus of the present invention. The key word pairs of this embodiment are examples of the key word combinations of the present invention. The dependency relation analyzing means 2 of this embodiment is an example of the associating means of the present invention. The key word extracting means 5 and the sentence example selecting means 7 of this embodiment are examples of the converting means of the present invention. The classified vocabulary table 12 of this embodiment is an example of the classing information of the present invention. The dependency relation analyzing means 2 and the word classing means 13 of this embodiment are examples of the associating means of the present invention. The key word extracting means 5 and the sentence example selecting means 14 of this embodiment are examples of the converting means of the present invention. The key word extracting means 5 and the sentence example selecting means 21 of this embodiment are examples of the converting means of the present invention. The meaning code of this embodiment is a example of the class name of the present invention.

[0198] Further, the predetermined number of the present invention is not limited to two which is the predetermined number in this embodiment, and it may be one. In this case, instead of combining key words into key word pairs and comparing the key word pairs, the key words are independently compared. Moreover, the predetermined number may be three. In this case, combinations of three key words are used for the comparison. Moreover, the predetermined number may be two and three. In this case, both key word pairs and combinations of three key words are used for the comparison. To sum up, it is necessary for the predetermined number of the present invention only to be a given positive integer or a plurality of different given positive integers.

[0199] Further, the expression converting apparatus of the present invention is not limited to the interpreting apparatus of this embodiment; it is necessary for the expression converting apparatus of the present invention only to be an apparatus that converts an input sentence into a different expression such as: a translating apparatus that converts an input text into a text in a different language and outputs the converted text, for example, converts an input Japanese text into an English text and outputs the English text; a sentence pattern converting apparatus that converts written language into spoken language; and a summary creating apparatus that summarizes a complicated or redundant sentence and outputs a summary.

[0200] Further, the expression converting method of the present invention is not limited to the interpreting method of this embodiment; it is necessary for the expression converting method of the present invention only to be a method of converting an input sentence into a different expression such as: translation to convert an input text into a text in a different language and output the converted text, for example, convert an input Japanese text into an English text and output the English text; sentence pattern conversion to convert written language into spoken language; and summary creation to summarize a complicated or redundant sentence and output a summary.

[0201] By previously creating the example database in which standard or simplified expression patterns of output target language sentences are associated with the key word groups corresponding to the patterns and the co-occurrence relation between the key word groups, extracting key words from the input sentence, selecting from the example database the expression pattern including the key word group most similar to the extracted key word group, and outputting the selected expression pattern as described above, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing.

[0202] Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0203] Further, even when a key word is erroneous, according to the degree of the error, it is possible to convert the input sentence into an expression of which meaning can correctly be understood or to notify the user that the meaning cannot be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0204] Further, the present invention is a program for causing a computer to perform the functions of all or some of the means (or apparatuses, devices, circuits, portions or the like) of the above-described expression converting apparatus of the present invention which program operates in cooperation with the computer.

[0205] Further, the present invention is a program for causing a computer to perform the operations of all or some of the steps (or processes, operations, actions or the like) of the above-described expression converting method of the present invention which program operates in cooperation with the computer.

[0206] Some of the means (or apparatuses, devices, circuits, portions or the like) of the present invention and some of the steps (or processes, operations, actions or the like) of the present invention mean some means of the plural means and some steps of the plural steps, respectively, or mean some functions of one means and some operations of one step, respectively.

[0207] Moreover, a computer-readable record medium on which the program of the present invention is recorded is also included in the present invention.

[0208] Moreover, a usage pattern of the program of the present invention may be such that the program is recorded on a computer-readable record medium and operates in cooperation with the computer.

[0209] Moreover, a usage pattern of the program of the present invention may be such that the program is transmitted through a transmission medium to be read by a computer and operates in cooperation with the computer.

[0210] The record medium includes ROMs. The transmission medium includes a transmission medium such as the Internet, and light, radio waves and sound waves.

[0211] As described above, the structure of the present invention may be realized either as software or as hardware.

[0212] As detailed above, according to this embodiment, by extracting key words from the input sentence, converting the input sentence into a standard or simplified expression sentence representative of the same meaning by use of the extracted key words, and outputting the expression sentence, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing. Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0213] Moreover, by extracting, as key words, words which are some of the content words included in the input sentence or the words into which the words which are some of the content words are converted, and generating a standard or simplified expression sentence comprising a combination of the key words and the expression decided by the sentence meaning presumed from the input sentence, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing.

[0214] Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0215] Moreover, by extracting key words from the input sentence, presuming the sentence meaning from the co-occurence dependency relation or the co-occurrence relation between the extracted key words, and generating a standard or simplified expression from a combination of only words predetermined from the key words or the equivalents of the key words and the presumed intention, the conversion rules and the sentence generation rules can be made compact, so that expression conversion can be performed with simple processing.

[0216] Moreover, even when a sentence that is erroneous in a part other than the key words is input, the input sentence can be converted into an expression of which meaning can correctly be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0217] Moreover, by previously creating the example DB in which output standard or simplified expression patterns, the key word groups corresponding to the patterns and the dependency relation or the co-occurrence relation between key word groups are associated, extracting a key word group from the input sentence, selecting from the example DB an expression pattern including the key word group most similar to the extracted key word group, and outputting the selected sentence example, in addition to the above-mentioned effects, expression conversion can accurately be performed faithfully to the kind, the domain and the sentence pattern of the actually input sentence.

[0218] Moreover, by the expression patterns written in the example DB each consisting of only key words or equivalents of the key words, the effects can be enhanced.

[0219] Moreover, by the description of the expression patterns written in the example DB, the key word groups and the dependency relation between the key word groups or the key words that are in a co-occurrence relation being a description of classes of words including the key words, in addition to the above-mentioned effects, even when a key word not included in the example DB is input, an appropriate sentence example can be selected, so that expression conversion capable of handling a wider variety of input sentences is enabled.

[0220] Moreover, by extracting a key word group from the input sentence, presuming an input error word from the relation between the extracted key words, presuming the sentence meaning from the key words other than the key word presumed to be an input error word, and generating a standard or simplified expression from a word combination decided by the presumed sentence meaning, in addition to the above-mentioned effects, even when a key word is erroneous, according to the degree of seriousness of the error, it is possible to convert the input sentence into an expression of which meaning can correctly be understood or to notify the user that the meaning cannot be understood, so that the conventional problem that an expression conversion result in which an erroneous part remains is output can be solved.

[0221] The present invention can provide an expression converting method, an expression converting apparatus and a program being compact in structure and capable of high-speed processing.

[0222] Moreover, the present invention can provide an expression converting method, an expression converting apparatus and a program capable of, even when a part other than the key words of the input sentence is erroneously recognized because of a voice recognition error or the like, outputting a result that correctly conveys the intention without the quality of the output sentence adversely affected.

[0223] Moreover, the present invention can provide an expression converting method, an expression converting apparatus and a program capable of, even when a part of the input sentence is erroneously recognized because of a voice recognition error or the like, avoiding the conventional problem that a result not conveying the sentence meaning at all is output.

Claims

1. An expression converting method wherein for each sentence in a corpus, key words are selected from the sentence, a combination of key words that are in a co-occurrence relation is identified from among a predetermined number of combinations of key words among the selected key words, and the identified key word combination and an expression into which the sentence from which the key words are selected is converted are previously associated, and

wherein predetermined key words are selected from an input sentence, the selected key words are combined, the key word combinations and the previously identified key word combination of each sentence are compared, one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison are selected, and expressions into which the selected sentences are converted are output, and
the predetermined key words of each sentence in the corpus are stored,
from the stored predetermined key words of each sentence in the corpus, predetermined combinations of key words are selected as having a dependency relationship and stored, and
the predetermined combinations of key words are individually selected for each sentence from the stored predetermined key words of the sentence.

2. An expression converting apparatus comprising:

associating means of, for each sentence in a corpus, selecting key words from the sentence, identifying a combination of key words that are in a co-occurrence relation from among a predetermined number of combinations of key words among the selected key words, and previously associating the identified key word combination and an expression into which the sentence from which the key words are selected is converted; and
converting means of selecting predetermined key words from an input sentence, combining the selected key words, comparing the key word combinations and the previously identified key word combination of each sentence, selecting one or more than one sentences that coincide or have a high degree of similarity as a result of the comparison, and outputting expressions into which the selected sentences are converted, and
the predetermined key words of each sentence in the corpus are stored,
from the stored predetermined key words of each sentence in the corpus, predetermined combinations of key words are selected as having a dependency relationship and stored, and
the predetermined combinations of key words are individually selected for each sentence from the stored predetermined key words of the sentence.

3. An expression converting apparatus according to claim 2, wherein when the degree of similarity is high as the result of the comparison, said converting means outputs the selected expression after removing a part of the selected expression into which a key word is converted, said key word belonging to the previously identified key word combination of the selected sentence that does not coincide with the combined key word combination and not being included in the previously identified key word combination of the selected sentence that coincide with the combined key word combination.

4. An expression converting apparatus according to claim 2, wherein said expression into which the sentence is converted comprises only key words or words equivalent to the key words.

5. A program for causing a computer to function as all or part of the expression converting apparatus of claim 2.

6. An expression converting method of claim 1 wherein the predetermined combinations of key words have a dependency relationship that includes a grammatically correct sequence of key words.

7. An expression converting apparatus of claim 2 wherein the predetermined combinations of key words have a dependency relationship that includes a grammatically correct sequence of key words.

Patent History
Publication number: 20040260533
Type: Application
Filed: Jul 12, 2004
Publication Date: Dec 23, 2004
Inventors: Yumi Wakita (Nara-shi), Kenji Matsui (Ikoma-shi)
Application Number: 10889345
Classifications
Current U.S. Class: Based On Phrase, Clause, Or Idiom (704/4)
International Classification: G06F017/28;