Patents by Inventor Tadataka Matsubayashi

Tadataka Matsubayashi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20140317137
    Abstract: The purpose of the invention is to provide a log management computer that shortens log search time while reducing log storage volume. The log management computer manages a log acquired from a log generating system that generates the log, which is an operation record. The log management computer is characterized by: extracting from a log message contained in the log, both a common portion that is common with another log message and a different portion that is different from another log message; storing the extracted common portion in common portion information of a storage area; storing the extracted different portion in different portion information of the storage area; and if a search request containing a search condition is received, searching for a log message that matches the search condition.
    Type: Application
    Filed: March 12, 2012
    Publication date: October 23, 2014
    Applicant: HITACHI, LTD.
    Inventors: Miyuki Hanaoka, Shinichi Kawamoto, Tadataka Matsubayashi
  • Patent number: 7752216
    Abstract: A retrieval apparatus 100 for searching document data comprises a document storage area 141 for storing documents to be searched and a document management table 142 for storing a data size of a document such that the data size is associated with a document ID for identifying the document. The retrieval apparatus 100 reads out from the document management table data sizes of documents to be searched, and calculates a retrieval document size by adding up the read out data sizes, and calculates an estimated time t1 taken for a retrieval process by the index scan method and an estimated time t2 taken for the retrieval process by the text scan method, based on the retrieval document size. The retrieval apparatus 100 compares the estimated times t1 and t2, and decides which method to use for a retrieval process, the index scan method or the text scan method.
    Type: Grant
    Filed: April 20, 2007
    Date of Patent: July 6, 2010
    Assignee: Hitachi, Ltd.
    Inventors: Kentaro Chigusa, Tadataka Matsubayashi, Kazutomo Ushijima, Tsuyoshi Sudo
  • Patent number: 7689545
    Abstract: In registering operation of a document to be searched for, a document identifier management table for managing a range of a document identifier stored for each page and a page identifier of the page is created, and an individual-search-server's search range management table for managing the range of the document identifier in charge of each search server is created. In searching operation of each search server of the document to be searched for, the individual-search-server's search range management table is referred to acquire a range of the allocated document identifier. For each index key forming a query term specified as a query condition, the document identifier management table is referred to to acquire the page identifier storing the document identifier of the allocated range. The searching operation is carried out by referring to a page shown by the acquired page identifier.
    Type: Grant
    Filed: July 21, 2005
    Date of Patent: March 30, 2010
    Assignee: Hitachi, Ltd.
    Inventors: Tadataka Matsubayashi, Michio Iijima, Yuichi Ogawa, Masaki Yotsutani, Shinya Yamamoto
  • Patent number: 7620614
    Abstract: The present invention realize a high speed retrieval performance in a document retrieval system referring to partial data of documents including structured data such as XML documents and electric mails, without providing further memory. The present invention includes storage means for storing documents to be retrieved onto a disk device, a calculation means for calculating an allocated capacity of the memory, and storage means for saving, onto the memory, partial data of the documents stored on the disk device by the calculated allocated capacity of the memory. The present invention also includes a first retrieval means for retrieving partial data stored on the memory, determining means for determining whether or not to retrieve the documents stored on the disk device based on the result from the first retrieval, and a second means for retrieving the documents stored on the disk device based on the result from the above determination.
    Type: Grant
    Filed: January 23, 2007
    Date of Patent: November 17, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Kazunari Sugiyama, Tadataka Matsubayashi, Katsushi Yako, Yasufumi Sato, Jugo Noda, Nobuo Kawamura
  • Patent number: 7558802
    Abstract: The technology for changing the nodes in an information retrieving system using a computer. When information items are registered by allocating to n nodes, steps are used to extract index information as a set of pairs of index keys of information items and addresses of information items, divide the index information into m (m>n) buckets and produce a partial inverted file to be closed within each of the buckets. Here, m and n are respectively integers of 1 (one) or above. When the allocation of the search-targeted ranges to the nodes is altered, the allocation to the buckets to each of the nodes is changed, and the partial inverted file of each bucket and the inverted file of the existing indexes are merged to produce new indexes, so that the indexes can be produced and updated with high speed.
    Type: Grant
    Filed: January 31, 2006
    Date of Patent: July 7, 2009
    Assignee: Hitachi, Ltd
    Inventors: Katsushi Yako, Norihiro Hara, Tadataka Matsubayashi
  • Patent number: 7440938
    Abstract: Information that individual elements (characteristic character strings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.
    Type: Grant
    Filed: May 5, 2004
    Date of Patent: October 21, 2008
    Inventors: Tadataka Matsubayashi, Natsuko Sugaya, Michio Iijima, Yuichi Ogawa, Yuuki Watanabe, Shinya Yamamoto, Tsuyoshi Sudou
  • Publication number: 20080154882
    Abstract: A retrieval apparatus 100 for searching document data comprises a document storage area 141 for storing documents to be searched and a document management table 142 for storing a data size of a document such that the data size is associated with a document ID for identifying the document. The retrieval apparatus 100 reads out from the document management table data sizes of documents to be searched, and calculates a retrieval document size by adding up the read out data sizes, and calculates an estimated time t1 taken for a retrieval process by the index scan method and an estimated time t2 taken for the retrieval process by the text scan method, based on the retrieval document size. The retrieval apparatus 100 compares the estimated times t1 and t2, and decides which method to use for a retrieval process, the index scan method or the text scan method.
    Type: Application
    Filed: April 20, 2007
    Publication date: June 26, 2008
    Applicant: Hitachi, Ltd.
    Inventors: Kentaro Chigusa, Tadataka Matsubayashi, Kazutomo Ushijima, Tsuyoshi Sudo
  • Patent number: 7333983
    Abstract: Retrieval conditions inputted from a plurality of users are registered. According to the retrieval conditions, a retrieval is conducted for a text inputted. As a result of the retrieval, similarity of the text is calculated for each retrieval condition. The text is delivered to users of which the retrieval condition satisfies the similarity.
    Type: Grant
    Filed: November 24, 2003
    Date of Patent: February 19, 2008
    Assignee: Hitachi, Ltd.
    Inventors: Yasuhiko Inaba, Tadataka Matsubayashi, Katsumi Tada, Takuya Okamoto, Natsuko Sugaya, Yousuke Ushiroji
  • Publication number: 20070192274
    Abstract: The present invention realize a high speed retrieval performance in a document retrieval system referring to partial data of documents including structured data such as XML documents and electric mails, without providing further memory. The present invention includes storage means for storing documents to be retrieved onto a disk device, a calculation means for calculating an allocated capacity of the memory, and storage means for saving, onto the memory, partial data of the documents stored on the disk device by the calculated allocated capacity of the memory. The present invention also includes a first retrieval means for retrieving partial data stored on the memory, determining means for determining whether or not to retrieve the documents stored on the disk device based on the result from the first retrieval, and a second means for retrieving the documents stored on the disk device based on the result from the above determination.
    Type: Application
    Filed: January 23, 2007
    Publication date: August 16, 2007
    Inventors: Kazunari Sugiyama, Tadataka Matsubayashi, Katsushi Yako, Yasufumi Sato, Jugo Noda, Nobuo Kawamura
  • Patent number: 7231388
    Abstract: Similar document retrieving method and system for retrieving similar documents from a document database storing plural documents written in different languages with high accuracy while suppressing retrieval noise even when difference is found in the number of registered documents in dependence on the species of description languages. Statistical information concerning the registration-subjected documents is collected on a language-by-language basis upon registration thereof. Upon retrieval of documents similar to a query document, weights of words extracted from the query document are taken into account and on a language-by-language basis by referencing the statistical information.
    Type: Grant
    Filed: July 29, 2002
    Date of Patent: June 12, 2007
    Assignee: Hitachi, Ltd.
    Inventors: Tadataka Matsubayashi, Katsumi Tada, Yoshifumi Sato, Yasuhiko Inaba, Shin′ ya Yamamoto
  • Publication number: 20070100873
    Abstract: The technology for changing the nodes in an information retrieving system using a computer. When information items are registered by allocating to n nodes, steps are used to extract index information as a set of pairs of index keys of information items and addresses of information items, divide the index information into m (m>n) buckets and produce a partial inverted file to be closed within each of the buckets. Here, m and n are respectively integers of 1 (one) or above. When the allocation of the search-targeted ranges to the nodes is altered, the allocation to the buckets to each of the nodes is changed, and the partial inverted file of each bucket and the inverted file of the existing indexes are merged to produce new indexes, so that the indexes can be produced and updated with high speed.
    Type: Application
    Filed: January 31, 2006
    Publication date: May 3, 2007
    Applicant: Hitachi, Ltd.
    Inventors: Katsushi Yako, Norihiro Hara, Tadataka Matsubayashi
  • Patent number: 7200587
    Abstract: A similar document search method includes a step of extracting a characteristic word candidate as a candidate for a characteristic word from a seeds document including desired retrieval contents, a step of extracting as characteristic words of the seeds document, when the characteristic word candidate extracted by the extracting step is a compound characteristic word including a plurality of characteristic words, the compound characteristic word and constituent characteristic words included in the compound characteristic word from the characteristic word candidate, a step of calculating, according to the characteristic words extracted by the extracting step, similarity between the seeds document and a registration document, and a step of outputting as a retrieval result a result of the similarity calculated by the similarity calculating step.
    Type: Grant
    Filed: February 25, 2002
    Date of Patent: April 3, 2007
    Assignee: Hitachi, Ltd.
    Inventors: Tadataka Matsubayashi, Katsumi Tada, Yoshifumi Sato, Yasuhiko Inaba, Jugo Noda
  • Patent number: 7130849
    Abstract: A method for retrieving information from a computer system including a user terminal and a storage area, includes retrieving first information from the storage area using a first search query, the first search query having a first element and a first weight that is associated with the first element. The first search query has been formulated to retrieve target information. The first information includes at least a first data block. Second information is retrieved from the storage area using a second search query. The second search query has the first element, and a second weight that is associated with the first element. The second search query is derived from a relevance feedback provided on the first data block of the first information. An end-search criterion is provided to the user terminal. The end-search criterion provides information as to whether or not to end a first retrieval procedure for the target information.
    Type: Grant
    Filed: January 28, 2003
    Date of Patent: October 31, 2006
    Assignee: Hitachi, Ltd.
    Inventors: Takaaki Yayoi, Tadataka Matsubayashi, Yasuhiko Inaba, Yuichi Ogawa, Shinya Yamamoto, Masayuki Hamakawa
  • Publication number: 20060218137
    Abstract: Retrieval conditions inputted from a plurality of users are registered. According to the retrieval conditions, a retrieval is conducted for a text inputted. As a result of the retrieval, similarity of the text is calculated for each retrieval condition. The text is delivered to users of which the retrieval condition satisfies the similarity.
    Type: Application
    Filed: November 24, 2003
    Publication date: September 28, 2006
    Inventors: Yasuhiko Inaba, Tadataka Matsubayashi, Katsumi Tada, Takuya Okamoto, Natsuko Sugaya, Yousuke Ushiroji
  • Patent number: 7054860
    Abstract: In document retrieval having the relevance feedback function to modify a searching profile for retrieval on the basis of a user's evaluation to evaluate a search result as pertinent or impertinent, recommencement of the relevance feedback returned to a desired time is permitted. An evaluation inputted by a user, a searching profile modified by the evaluation and a search result based on the searching profile are all saved while making the correspondence between them. When a request for restoration of searching profile is made, a searching profile corresponding to an evaluation designated by the user is restored.
    Type: Grant
    Filed: October 6, 2003
    Date of Patent: May 30, 2006
    Assignee: Hitachi, Ltd.
    Inventors: Yasuhiko Inaba, Katsumi Tada, Natsuko Sugaya, Tadataka Matsubayashi, Akihiko Yamaguchi, Mikihiko Tokunaga
  • Publication number: 20060101004
    Abstract: In registering operation of a document to be searched for, a document identifier management table for managing a range of a document identifier stored for each page and a page identifier of the page is created, and an individual-search-server's search range management table for managing the range of the document identifier in charge of each search server is created. In searching operation of each search server of the document to be searched for, the individual-search-server's search range management table is referred to acquire a range of the allocated document identifier. For each index key forming a query term specified as a query condition, the document identifier management table is referred to to acquire the page identifier storing the document identifier of the allocated range. The searching operation is carried out by referring to a page shown by the acquired page identifier.
    Type: Application
    Filed: July 21, 2005
    Publication date: May 11, 2006
    Inventors: Tadataka Matsubayashi, Michio Iijima, Yuichi Ogawa, Masaki Yotsutani, Shinya Yamamoto
  • Patent number: 7039636
    Abstract: Word boundary identification operations such as morpheme analysis is performed on documents to be registered, and the top positions and the end positions of words are identified. Word boundary information is obtained based on these identification results. Search indexes are created for sub-strings of a predetermined length (n-grams) extracted from the document being registered. The search index includes document identification information as well as occurrence position information which indicates that the string is located at the n-th position from the beginning of the text data, and word boundary information for an n-gram in a document.
    Type: Grant
    Filed: June 9, 2003
    Date of Patent: May 2, 2006
    Assignee: Hitachi, Ltd.
    Inventors: Katsumi Tada, Takuya Okamoto, Natsuko Sugaya, Tadataka Matsubayashi, Yasuhiko Inaba, Yasushi Kawashimo
  • Patent number: 6865571
    Abstract: A document retrieval method using a computer program includes retrieving a first set of documents using a first query expression generated by the computer program. The first set of documents is provided to a user. An evaluation of the first set of documents is received from the user. The first query expression is changed to a second query expression generated by the computer program based on the evaluation.
    Type: Grant
    Filed: September 13, 2001
    Date of Patent: March 8, 2005
    Assignee: Hitachi, Ltd.
    Inventors: Yasuhiko Inaba, Katsumi Tada, Natsuko Sugaya, Tadataka Matsubayashi, Akihiko Yamaguchi, Mikihiko Tokunaga
  • Publication number: 20050021508
    Abstract: Information that individual elements (characteristic character rings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.
    Type: Application
    Filed: May 5, 2004
    Publication date: January 27, 2005
    Inventors: Tadataka Matsubayashi, Natsuko Sugaya, Michio Iijima, Yuichi Ogawa, Yuuki Watanabe, Shinya Yamamoto, Tsuyoshi Sudou
  • Patent number: 6826567
    Abstract: A registration/search method for structured documents where correspondence data is prepared between a fixed-length-string and a string occurrence position within a structured document for all fixed-length-strings in the document and for each structured document. A list of a character and all hierarchical elements containing the character and element lengths is prepared. An occurrence frequency and an occurrence position of a search term is obtained using the plurality of fixed-length-substrings and the occurrence frequency extracting index. A search character is selected from the search term. A hierarchical element containing the search character is obtained using the character from the element length index. A length of the element corresponding to a search range is extracted using the obtained occurrence position. A matching degree for the search term is calculated from the obtained occurrence frequency of the search term and the extracted element length of the element corresponding to the search range.
    Type: Grant
    Filed: August 15, 2002
    Date of Patent: November 30, 2004
    Assignee: Hitachi, Ltd.
    Inventors: Katsumi Tada, Natsuko Sugaya, Tadataka Matsubayashi, Takuya Okamoto, Yasushi Kawashimo