Apparatus and method for extracting and sharing information
A message exchange unit receives and distributes a message in a community in which a plurality of users participates. A message memory stores the message received by the message exchange unit. An extraction rule memory stores an extraction rule applicable for the message. The extraction rule includes an extraction pattern, an extraction target and a display format. An information extraction unit extracts information from the message stored in the message memory by referring to the extraction rule. An extraction result memory stores extraction result information by the information extraction unit. An extraction result editing unit edits the extraction result information stored in the extraction result memory. A message generation unit generates another message representing editing of the extraction result information.
Latest Kabushiki Kaisha Toshiba Patents:
This application is a divisional of U.S. application Ser. No. 10/384,603, filed Mar. 11, 2003, and is based upon and claims the benefit of priority from the prior Japanese Patent Application P2002-069524, filed on Mar. 14, 2002, the entire contents of each of which are incorporated herein by reference.
FIELD OF THE INVENTIONThe present invention relates to an apparatus and method for extracting and sharing information based on message exchange through an electronic bulletin board or a mail on Internet or Intranet.
BACKGROUND OF THE INVENTIONAs a means for mutually exchanging information by a plurality of users, a mailing list using an electronic mail, an electronic bulletin board using WWW (World Wide Web) and a chat, are widely used. In the mailing list, a plurality of electronic mails are collected and collectively sent as a message to all members. In the electronic bulletin board, a space to jointly own information is set on a network, and a plurality of registered users or anonymous users can freely enter a message. In the chat or online chat room, in the same way of the electronic bulletin board, the space to jointly own information is set and a text message can be exchanged in real time.
In a communication means aiming at message exchange of many users such as the mailing list, the electronic bulletin board and the chat, a message related to a topic in which a majority of participants is interested is often exchanged. In this case, a group of users who exchange an electronic message with common purpose or interest is called “a community”.
In an information exchange system on the network, if the number of messages to exchange becomes large, a user often misses important information included in the message or it is difficult for the user to understand the contents appeared or discussed over a plurality of messages. Accordingly, a system for extracting predetermined information included in the plurality of messages and for preserving as information resource different from the plurality of messages is proposed.
For example, as a method for extracting schedule data from information of personally received electronic mail based on a rule to extract daily event and for presenting the schedule data, Japanese Patent Disclosure (Kokai) PH9-269940 “Device for Extracting Date or the like”, and “Extraction of Schedules and T0-P0 Items from E-mail Messages by Identifying Messages Structures and Using Language Expressions” (A thesis of Information Processing Society, Vol. 40, No. 10, pp. 3694-3705, 1999), are known. Furthermore, as a method for extracting date information from an information source except for the mail and for displaying the date information in a calendar format, Japanese Patent Disclosure (Kokai) P2000-215213 “Method and System for Preparing Table based on Distributed Type Documents” is known. Furthermore, in the community system, as a method for preserving document information corresponding to a plurality of messages in order to manage information apt to be embedded in the message, Japanese Patent Disclosure (Kokai) P2002-140344 “Method and Device for Document Management” is known.
In such a communication system in which a plurality of users exchange and jointly own the message, it is difficult for the user to remember which message includes important information in case of necessary the important information, and the user must repeatedly read a large number of messages in case of information to be often referred. However, in case of jointly owning a simple retrieval result, reutility is scarce because the presentation format is simple and unnecessary information often mixes.
Furthermore, important information exists in the message, in addition to date information such as a meeting program or a schedule, arranged by collecting other's comment or reply for a topic or a question, and arranged by collecting a predetermined expression or format part (such as URL (Uniform Resource Locator) accessible by WWW and mail address) included in the message.
On the other hand, if each user personally arranges the important part included in information such as the mail exchanged by communication among a plurality of users, operation efficiency is bad. In addition to this, an important event (information or status) cannot be jointly owned among the users communicating each other because important information is different for each user. Furthermore, even if collected information is jointly owned by disclosure, it is necessary for the user to check permission of inspection for the collected information or permission of reference for information (such as the mail from which the important item is extracted).
The above-mentioned problem does not occur when each user simply arranges information from personal mails. However, the above-mentioned problem occurs in the community system in which a plurality of users exchanges and jointly owns the message. Accordingly, a system for easily generating an important information extraction rule and an extraction result, and for jointly owning the extraction rule and the extraction result, is necessary.
SUMMARY OF THE INVENTIONIt is an object of the present invention to provide an apparatus and method for extracting and sharing information able to easily create the information extraction rule and to jointly own the extraction result information in the community.
According to aspects of the present invention, there is provided an apparatus for extracting and sharing information, comprising: a message exchange unit configured to receive a first message from a user and distribute the first message in a community in which a plurality of users participates; a message memory configured to store the first message received by said message exchange unit; an extraction rule memory configured to store an extraction rule applicable for the first message, the extraction rule including an extraction pattern, an extraction target and a display format; an information extraction unit configured to extract information from the message stored in said message memory by referring to the extraction rule; an extraction result memory configured to store extraction result information by said information extraction unit; an extraction result editing unit configured to edit the extraction result information stored in said extraction result memory; and a message generation unit configured to generate a second or editing message representing editing to the extraction result information.
Further in accordance with another aspect of the present invention, there is also provided a method for extracting and sharing information, comprising: receiving a first message from a user in a community in which a plurality of users participates; storing the first message received in a message memory; extracting information from the first message stored in the message memory by referring to an extraction rule, the extraction rule including an extraction pattern, an extraction target and a display format; storing extraction result information in an extraction result memory; editing the extraction result information stored in the extraction result memory; and generating a second or editing message representing editing of the extraction result information.
Further in accordance with another aspect of the present invention, there is also provided a computer program product, comprising: a computer readable program code embodied in said product for causing a computer to extract and share information, said computer readable program code having: a first program code o receive a first message from a user in a community in which a plurality of users participates; a second program code to store the first message received in a message memory; a third program code to extract information from the first message stored in the message memory by referring to an extraction rule, the extraction rule including an extraction pattern, an extraction target and a display format; a fourth program code to store extraction result information in an extraction result memory; a fifth program code to edit the extraction result information stored in the extraction result memory; and a sixth program code to generate a second or editing message representing editing of the extraction result information.
BRIEF DESCRIPTION OF THE DRAWINGS
Hereinafter, various embodiments of the present invention will be explained by referring to the drawings.
An access control unit 7 decides whether new or changed information managed in the community exists by using the community management information, and executes registration, acquirement and change processing of such information.
Message information exchanged in the community is received by a message exchange unit 2 and stored in a message memory 9 through the access control unit 7. Furthermore, the message exchange unit 2 receives a message from each member of the community, registers the message in the message memory 9 and distributes the registered message to each member of the community.
An extraction rule editing unit 4 prepares a means for editing the extraction rule. This editing result is stored in an extraction rule memory 10. The extraction rule includes “an extraction pattern”, “an extraction target” and “a display format”.
An information extraction unit 6 reads a group of messages as the extraction target from the message memory 9 at predetermined timing, extracts a part matched with the extraction pattern from each message, and stores the extraction result in an extraction result memory 11.
An extraction result editing unit 3 acquires the extraction result indicated by the user from the extraction result memory 11, and executes operation such as addition, correction or deletion for the extraction result. Furthermore, the extraction result editing unit 3 changes the extraction rule based on change of the extraction result or necessity. The modified extraction rule and modified extraction result are respectively stored in the extraction rule memory 10 and the extraction result memory 11.
If new information is added to the extraction result, a message generation unit 5 generates a message (generally referred to as an editing message) including the contents of the new information. This message is also stored in the message memory 9. In this case, it is possible to select whether the message is distributed to each member of the community.
A community information display 1 is a means for displaying information managed by the community under a control of the access control unit 7. Concretely, the community management information, the contents of the message, the extraction rule and the extraction result, are selectively displayed.
In the community management information stored in the community management information memory 8,
Furthermore, as shown in
Hereinafter, a series of access and operation which the user executes for various information in the community is explained in detail. First, when display of explanation of the community is selected (YES at step 501), the explanation of the community is displayed (step 510).
My space 1006 is a link to display a screen for setting the user himself who logs in. Home 1002 is a link to this community explanation screen itself, and its underlining is omitted. Bulletin board 1003 is a link to display a screen of messages exchanged and stored in the community. Information extraction 1004 is a link to display an operation screen of the extraction result information and the extraction rule. By selecting an editing button 1008, a screen to edit the set and the explanation sentence displayed on the community explanation screen is presented.
Next, when operation of the message is selected (YES in step 502), processing is forwarded to the operation of the message (step 520).
Next, when an operation of the extraction rule is selected (step 503 in
In the flow chart of editing of the extraction rule of
As for “extraction pattern”, a plurality of typical, patterns to be extracted are prepared in order for the user to easily set. For example, “date information” is used to extract an year-month-day part (“date”), a time part (“time”), an event part (“what”) representing what happens at the date, and a place part (“where”) representing at where the event happens in case that a vocabulary related to a date appears in the message. In this case, the extraction pattern such as “date information” is called a “class”. The year-month-day part, the time part, the event part and the place part to be concretely extracted, are called a “class attribute”. For example, the class “date information (schedule)” of the extraction pattern includes four class attributes: “which day (date)”, “what time (time)”, “what” and “where”. As for other classes “collection of links”, “collection of questions and answers”, “the minutes” and “call”, the class attribute to be extracted is respectively prepared. For example, “collection of links (Links)” includes two class attributes “link” and “description”, “collection of questions and answers (QA)” includes two class attributes “question” and “answer”, “the minutes” includes one class attribute “utterance (remark)”, “call” includes one class attribute “business (what)”. Furthermore, each class includes a basic class attribute “extraction source message (message)”. The class “message” includes the class attributes “creator (author)”, “creation date (date)”, “ID (id)” and “parent message (parent)”. Each extraction pattern of the class corresponds to an extraction module for detail pattern matching. For example, in the extraction module to extract “date information”, a function to extract various character pattern representing not only “mm month, dd day, yyy year” such as “Oct. 9, 2001” but also “mm/dd” “mm-dd-yyy” is included for extraction year-month-day part. By using this function, smooth pattern matching can be executed. Furthermore, a standard pattern to certainly acquire a part corresponding to each class attribute of the extraction pattern from the message can be regulated. For example, as for “date information”, the standard pattern shown in
Return to
In
Next, the text of message of the extraction target is acquired from the message memory 9 in
In the data format of the extraction result shown in
In an editing screen of a list of information extraction shown in
It is not always that the extraction result using the extraction rule is usually adequate information for the user. Accordingly, in order for the user to manually edit a part of the extraction result or to add new information to the extraction result while reading the extraction result, a means for editing the extraction result is prepared. Hereinafter, a processing of editing the extraction result is explained.
In the case of selecting addition or change of the entry, for example, a screen as shown in
In
In
Last, the contents of editing of the extraction result are reflected in the corresponding extraction result date (step 99 in
As mentioned-above, according to an embodiment of the present invention, in a common system in which a plurality of users exchange and jointly own messages, the information extraction rule is easily created and the extraction result information are jointly owned by each user. As a result, the communication can be promoted with easy arrangement format of the messages while each user is conscious of arrangement of important information in the messages afterwards. Furthermore, in addition to meeting program or schedule information, a part to be essentially referred or reused can be extracted from the messages as a document format.
For embodiments of the present invention, the processing of the present invention can be accomplished by one or more computer-executable programs, and this program can be realized in a computer-readable memory device.
In embodiments of the present invention, the memory device, such as a magnetic disk, a floppy disk, a hard disk, an optical disk (CD-ROM, CD-R, DVD, and so on), an optical magnetic disk (MD, and so on) can be used to store instructions for causing a processor or a computer to perform the processes described above.
Furthermore, based on an indication of the program installed from the memory device to the computer, OS (operation system) operating on the computer, or MW (middle ware software), such as database management software or network, may execute one part of each processing to realize the embodiments.
Furthermore, the memory device is not limited to a device independent from the computer. By downloading a program transmitted through a LAN or the Internet, a memory device in which the program is stored is included. Furthermore, the memory device is not limited to one. In the case that the processing of the embodiments is executed by a plurality of memory devices, a plurality of memory devices may be included in the memory device. The component of the device may be arbitrarily composed.
In embodiments of the present invention, the computer executes each processing stage of the embodiments according to the program stored in the memory device. The computer may be one apparatus such as a personal computer or a system in which a plurality of processing apparatuses are connected through the network. Furthermore, in the present invention, the computer is not limited to the personal computer. Those skilled in the art will appreciate that a computer includes a processing unit in an information processor, a microcomputer, and so on. In short, the equipment and the apparatus that can execute the functions in embodiments of the present invention using the program are generally called the computer.
Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with the true scope and spirit of the invention being indicated by the following claims.
Claims
1. An apparatus for extracting and sharing information, comprising:
- a message exchange unit configured to receive a first message from one of a plurality of users and distribute the first message to a community in which the plurality of users participate;
- a message memory configured to store the first message received by said message exchange unit;
- an extraction rule memory configured to store an extraction rule including an extraction pattern, an extraction target and a display format, the extraction pattern being used for extracting an information from the first message, the display format being used for displaying an extraction result of each extraction pattern;
- an information extraction unit configured to extract an information from the first message stored in the message memory by restricting a range of the first message with the extraction target and by matching the first message with the extraction pattern;
- an extraction result memory configured to store an extraction result information by the information extraction unit;
- an extraction result editing unit configured to display the extraction result information by applying the display format, and to edit the extraction result information; and
- a message generation unit configured to generate a second message representing editing of the extraction result information;
- wherein, when the message generation unit generates the second message, the message memory stores the second message as a reply of the first message, and
- the message exchange unit sends the second message to the one of the plurality of users.
2. The apparatus according to claim 1,
- further comprising an extraction rule editing unit configured to edit the extraction rule stored in the extraction rule memory.
3. The apparatus according to claim 2,
- when the extraction rule editing unit creates a new extraction rule,
- wherein the extraction rule editing unit selects one of a plurality of extraction rules each corresponding to a predetermined set value.
4. The apparatus according to claim 2 or 3,
- wherein the extraction rule editing unit transfers or copies the extraction rule managed by a first community to a second community.
5. The apparatus according to claim 2 or 3,
- wherein the extraction rule editing unit generates a new extraction rule by unifying a plurality of extraction rules.
6. The apparatus according to any of claim 1, 2 or 3,
- further comprising a community information display configured to present an information of the community to the plurality of users;
- when the community information display presents a third message as an applicable object of the extraction rule in the community,
- wherein the community information display presents a list of extraction rules applied to the third message.
7. A method for extracting and sharing information, comprising:
- receiving a first message from one of a plurality of users in a community in which the plurality of users participate;
- storing the first message received in a message memory;
- extracting an information from the first message stored in the message memory by referring to an extraction rule, the extraction rule including an extraction pattern, an extraction target and a display format, the extraction pattern being used for matching with the first message, the extraction target being used for restricting a range of the first message, the display format being used for displaying an extraction result of each extraction pattern;
- storing an extraction result information from the first message in an extraction result memory;
- displaying the extraction result information by applying the display format;
- editing the extraction result information;
- generating a second message representing editing of the extraction result information;
- storing the second message as a reply of the first message in the message memory; and
- sending the second message to the one of the plurality of users.
8. A computer program product, comprising:
- a computer readable program code embodied in the product for causing a computer to extract and share information, the computer readable program code having:
- a first program code to receive a first message from one of a plurality of users in a community in which the plurality of users participate;
- a second program code to store the first message received in a message memory;
- a third program code to extract an information from the first message stored in the message memory by referring to an extraction rule, the extraction rule including an extraction pattern, an extraction target and a display format, the extraction pattern being used for matching with the first message, the extraction target being used for restricting a range of the first message, the display format being used for displaying an extraction result of each extraction pattern;
- a fourth program code to store an extraction result information from the first message in an extraction result memory;
- a fifth program code to display the extraction result information by applying the display format;
- a sixth program code to edit the extraction result information;
- a seventh program code to generate a second message representing editing of the extraction result information;
- an eighth program code to store the second message as a reply of the first message in the message memory; and
- a ninth program code to send the second message to the one of the plurality of users.
Type: Application
Filed: Dec 11, 2007
Publication Date: May 1, 2008
Applicant: Kabushiki Kaisha Toshiba (Tokyo)
Inventors: Hideo Umeki (Kanagawa-ken), Mika Fukui (Tokyo), Kazuyuki Gotoh (Kanagawa-ken), Takehiko Yokota (Tokyo), Sougo Tsuboi (Kanagawa-ken)
Application Number: 12/000,238
International Classification: G06F 3/00 (20060101);