Sequential Access, E.g., String Matching, Etc. (epo) Patents (Class 707/E17.039)
  • Patent number: 9996603
    Abstract: Techniques are provided for automatically detecting homologies between documents based on structural characteristics. Various statistics relating to the COS structure of a PDF document are compiled. The statistics are input into a rolling hash function to generate a digital fingerprint of the document. Fingerprints from two similar documents will have small edit distances between them, and can therefore be classified similarly or provided as results to a fingerprint-based search. For example, an unclassified document may be classified in the same class as a representative document where the fingerprints of the two documents have a small edit distance between them. Since the structure of the document is used instead of the text content or renderings, it is possible to operate on encrypted documents. Further, representative elements of a particular class of documents can be selected for comparison against a target document for improved resolution of the results.
    Type: Grant
    Filed: October 14, 2014
    Date of Patent: June 12, 2018
    Assignee: Adobe Systems Inc.
    Inventor: Kapil Khattar
  • Patent number: 9934308
    Abstract: In an embodiment, a method comprises performing a first search based on a first text string; based on the first search, identifying one or more search results; after identifying the one or more search results, causing the one or more search results to be displayed; while the one or more search results are displayed: receiving input that changes the first text string to a modified text string; in response to receiving the input: performing a second search based, at least in part, on a portion of the modified text string, and determining that no search results are identified based on the second search; after determining that no search results are identified based on the second search, continuing to display the one or more search results; wherein the method is performed by one or more computing devices.
    Type: Grant
    Filed: January 28, 2015
    Date of Patent: April 3, 2018
    Assignee: Quotient Technology Inc.
    Inventor: Jeffrey Roloff
  • Patent number: 9582481
    Abstract: Method and apparatus for managing versions of a portion of a document on a computing system are described. Multiple versions of an object in the document are maintained on the computing system. A graphical representation of at least two versions of the object is displayed on a display to a user of the computing system. A previous version of the object is selected from the at least two versions of the object through an input device, and the object in the document is restored to the selected previous version. The object is a portion of the document less than the entirety of the document.
    Type: Grant
    Filed: September 29, 2012
    Date of Patent: February 28, 2017
    Assignee: APPLE INC.
    Inventors: Alan Chu, Wing Hei Law
  • Patent number: 9465651
    Abstract: A remote processor interacts with a transactional memory that has a memory, local BWC (Byte-Wise Compare) resources, and local NFA (Non-deterministic Finite Automaton) engine resources. The processor causes a byte stream to be transferred into the transactional memory and into the memory. The processor then uses the BWC circuit to find a character signature in the byte stream. The processor obtains information about the character signature from the BWC circuit, and based on the information uses the NFA engine to process the byte stream starting at a byte position determined based at least in part on the results of the BWC circuit. From the time the byte stream is initially written into the transactional memory until the time the NFA engine completes, the byte stream is not read out of the transactional memory.
    Type: Grant
    Filed: January 9, 2014
    Date of Patent: October 11, 2016
    Assignee: Netronome Systems, Inc.
    Inventors: Gavin J. Stark, Steven W. Zagorianakos
  • Patent number: 9460127
    Abstract: Technologies and implementations for allocating datacenter resources are generally disclosed.
    Type: Grant
    Filed: December 3, 2014
    Date of Patent: October 4, 2016
    Assignee: EMPIRE TECHNOLOGY DEVELOPMENT LLC
    Inventor: Ezekiel Kruglick
  • Publication number: 20150117796
    Abstract: Systems, methods, and machine-readable media for prioritizing points of interest for display in a map have been described. In certain aspects a system may include an interface module, a search module, and a priority module. The interface module may be configured to obtain pattern recognition data and location data associated with an image of a geographical area. The search module may be configured to access a set of point of interest listings and determine whether the pattern recognition data and the location data associated with the image correspond with a point of interest listing in the set of point of interest listings. The priority module may be configured to prioritize the point of interest listing for display in a map if a point of interest listing in the set of point of interest listings is determined to correspond.
    Type: Application
    Filed: March 5, 2012
    Publication date: April 30, 2015
    Applicant: Google Inc.
    Inventors: Harlan Gerhardt Hile, Nicholas John Lee
  • Patent number: 8805095
    Abstract: A method for analyzing a character string, the method including: analyzing a character string to determine one of more characters of the character string; determining from a dictionary source, an alternative character string to the analyzed character string; comparing the analyzed character string with the alternative character string to determine a weighting factor for each of the characters of the analyzed character string relative to the positional arrangement of the characters in the alternative character string; and for each determined weighting factor, generating for each of the characters in the analyzed character string a corresponding character of a particular size as determined by the weighting factor.
    Type: Grant
    Filed: September 27, 2011
    Date of Patent: August 12, 2014
    Assignee: International Business Machines Corporation
    Inventor: Flemming Boegelund
  • Patent number: 8782082
    Abstract: One embodiment relates to a computer-implemented method for multiple-keyword matching performed using a computer including at least a processor, data storage, and computer-readable instructions. A keyword set and a text input to be searched are obtained. The keyword set is processed to create a reverse trie. A search procedure which starts from the end of the text is then applied using the reverse trie to find keyword occurrences in the text input. Other embodiments, aspects, and features are also disclosed.
    Type: Grant
    Filed: November 7, 2011
    Date of Patent: July 15, 2014
    Assignee: Trend Micro Incorporated
    Inventors: Qiuer Xu, Liwei Ren
  • Publication number: 20140129569
    Abstract: Systems and methods to extract a string from a communication. A method includes: receiving a communication comprising a plurality of strings; assigning a score to each of the strings, wherein the score assigned to each of the strings corresponds to a frequency of usage of the respective string for a first function relative to an overall frequency of usage of the respective string; determining a respective total sum for each of a plurality of sequences in the communication, the respective total sum determined as a sum of the scores for each string in the respective sequence; and extracting a first sequence of the sequences from the communication based on the total sum for the first sequence. In one embodiment, the total sum includes an additional score for each of a starting word and an ending word of the first word sequence, wherein each respective additional score is associated with a probability that the starting (or ending) word is used as the first (or last word) of an address.
    Type: Application
    Filed: November 2, 2012
    Publication date: May 8, 2014
    Applicant: XOBNI CORPORATION
    Inventors: Richard Earle Seeger, III, Peter Monaco
  • Publication number: 20140122509
    Abstract: A system, method, and computer program product are provided for performing a string search. In use, a first string and a second string are identified. Additionally, a string search is performed, utilizing the first string and the second string.
    Type: Application
    Filed: November 1, 2012
    Publication date: May 1, 2014
    Applicant: NVIDIA CORPORATION
    Inventors: Jacopo Pantaleoni, David Tarjan
  • Publication number: 20140040261
    Abstract: A method for submatch extraction may include receiving an input string, receiving a regular expression, and converting the regular expression with capturing groups into a plurality of finite automata to extract submatches. The method further includes using a first automaton to determine whether the input string is in a language described by the regular expression, and to process the input string, and using states of the first automaton in a second automaton to extract the submatches. In addition, input partitioning and automaton minimization techniques may be employed to reduce the storage area consumed by the plurality of finite automata.
    Type: Application
    Filed: July 31, 2012
    Publication date: February 6, 2014
    Inventors: William G. Horne, Prasad V. Rao
  • Publication number: 20140032480
    Abstract: In some embodiments, a method includes receiving a command to update a form component within a set of one or more form templates. The method also includes receiving an alternative form component for the form component that is to be updated. The method includes performing a pattern matching across a number of form templates to identify the set of one or more form templates in the number of form templates. The method includes refactoring the set of one or more form templates. The refactoring includes changing a reference in the set of one or more form templates to reference the alternative form component.
    Type: Application
    Filed: September 4, 2007
    Publication date: January 30, 2014
    Inventor: Daniel Lesage
  • Publication number: 20140006456
    Abstract: An embodiment for evaluating rules pertaining to an event includes providing a rule trie comprising multiple nodes, wherein match conditions must be met to trigger a given rule, and each match condition corresponds to a different node. An attribute trie is provided for a string attribute, which includes a node sequence having a value. The string attribute is extracted from the event and also has a value. The value of the extracted string attribute and the value of the node sequence are compared, by traversing a path extending along the node sequence. Responsive to determining that the values of the extracted string attribute and the node sequence are the same, notice is provided that the match condition of a particular node of the rule trie has been met by an attribute of the received event.
    Type: Application
    Filed: August 7, 2012
    Publication date: January 2, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Arun Ramakrishnan, Rohit Shetty
  • Publication number: 20130318098
    Abstract: Website and newsletter operators may only have users' email addresses but not their person names. It can be beneficial for such operators to derive additional information about the email address owners so they can improve their services. However, there is no easy way to retrieve owners' information based on only email addresses. The present disclosure provides information such as the owners' names, professional profiles and aggregated statistics with only a list of email addresses as the input.
    Type: Application
    Filed: May 24, 2012
    Publication date: November 28, 2013
    Inventors: Stephen Chan, Edbert Chan
  • Publication number: 20130311495
    Abstract: A regular expression matching hardware implementation comprises two tightly coupled hardware engines. A regex engine performs state transitions and accepts (as matching) or rejects (as not matching) an input string. The regex engine takes also care of the logic of the operators and deals with the complexity of the state machine. A lookup engine reads characters from an input (e.g., tape, memory, network packets, or the like), and provides them to the regex engine. A preprocessing procedure transforms a regular expression into a regex state transition table and a lookup table, for use by the regex engine and lookup engine, respectively. The two hardware engines are synchronized by a global state machine. The regex engine advances the state machine, and the lookup engine reads it.
    Type: Application
    Filed: May 15, 2012
    Publication date: November 21, 2013
    Inventor: Frederic Rossi
  • Publication number: 20130262439
    Abstract: A method includes identifying at least one document associated with content from at least one digital content source. The at least one document includes information identifying at least one aspect of the content. The method also includes determining a document index for the at least one document based on keywords included in the at least one document. An activity field is inserted into the document index. The method includes accessing activity information. The activity information identifies at least one activity keyword associated with at least one activity. The method further includes identifying at least one present activity keyword in the document based on the activity information. An indicator of at least one present activity is determined based on the at least one present activity keyword. The method includes indexing the indicator of the at least one present activity in the activity field of the document index.
    Type: Application
    Filed: March 27, 2012
    Publication date: October 3, 2013
    Applicant: VERIZON PATENT AND LICENSING INC.
    Inventors: Jack Jianxiu Hao, Zhiying Jin, Martin Busse, Jimena Velarde
  • Publication number: 20130246433
    Abstract: A record-pattern expression of a record pattern of data records is translated to a character-pattern expression of a character pattern. A data-record series of data records is converted to a character string in which each character in the character string corresponds to a respective data record in the data-record series. Pattern instances of the character pattern in the character string are searched to identify ordinal character positions in the character string of characters in the character pattern instances. Data records at ordinal record positions in the record series of data records corresponding to the ordinal character positions are reported.
    Type: Application
    Filed: March 15, 2012
    Publication date: September 19, 2013
    Inventors: Matthew Steven Fuller, Mingsheng Hong, Hongmin Fan, Andrew Allinson Lamb, William F. Mann, Lakshmikant Shrinivas
  • Publication number: 20130232163
    Abstract: In an embodiment, a method comprises performing a first search based on a first text string; based on the first search, identifying one or more search results; after identifying the one or more search results, causing the one or more search results to be displayed; while the one or more search results are displayed: receiving input that changes the first text string to a modified text string; in response to receiving the input: performing a second search based, at least in part, on a portion of the modified text string, and determining that no search results are identified based on the second search; after determining that no search results are identified based on the second search, continuing to display the one or more search results; wherein the method is performed by one or more computing devices.
    Type: Application
    Filed: March 5, 2012
    Publication date: September 5, 2013
    Inventor: Jeffrey Roloff
  • Publication number: 20130204895
    Abstract: A method of operation of a navigation system includes: detecting a search string; determining a forward-element using a forward-boundary within the search string in a forward direction; determining a backward-element in a backward direction up to the forward-boundary using a backward-boundary within the search string; and determining a matching result for the search string using the forward-element and the backward-element for displaying on a device.
    Type: Application
    Filed: February 6, 2012
    Publication date: August 8, 2013
    Applicant: TELENAV, INC.
    Inventor: BoSen Liao
  • Patent number: 8495101
    Abstract: An information processing method for defining a data structure for pattern matching, the method executed by an information processing apparatus, comprises generating, by the apparatus, an ordered tree structure by defining transition edges between nodes using, as transition conditions, respective constraints from one or more constraint patterns each including plural constraints; searching, by the apparatus, for a second substructure similar to a first substructure from a root node by determining a set relation between transition conditions of respective transition edges; and defining, by the apparatus, an additional transition link from a tail node of the second substructure to a child node at a tail end of the first substructure, the additional transition link adding a constraint to be met by an indeterminant identified from the set relation.
    Type: Grant
    Filed: February 29, 2012
    Date of Patent: July 23, 2013
    Assignee: International Business Machines Corporation
    Inventors: Teruo Koyanagi, Yuta Tsuboi
  • Patent number: 8495075
    Abstract: A method accomplishes matching a particular media metadata record to one of a plurality of records of a first media metadata database. For the particular media metadata record, for each of at least some of the tokens of the particular media metadata record, a table is processed that correlates that token to records of the first media metadata database. Based on the processing, it is determined in which of the plurality of records (match candidate records) of the first media metadata database that token of the particular record appears. For each of at least some of the match candidate records of the first media metadata database, a value is determined that indicates a degree of match between that match candidate record and the particular media metadata record. The determined values are processed to determine one of the match candidate records is a nominal match to the particular media metadata record.
    Type: Grant
    Filed: March 8, 2006
    Date of Patent: July 23, 2013
    Assignee: Apple Inc.
    Inventor: Mark Miller
  • Publication number: 20130179468
    Abstract: A system and method for searching a product model number database having a plurality of product model numbers stored therein receives a string to be searched in the product model number database from a client device. Multiple queries are then created based on the string and the model number database is queried using the created queries. Query results responsive to the created queries are then received from the model number database and transmitted to the client device.
    Type: Application
    Filed: January 10, 2012
    Publication date: July 11, 2013
    Applicant: W.W. Grainger, Inc.
    Inventor: Geoffry A. Westphal
  • Publication number: 20130151538
    Abstract: An entity summarization system is described herein that mines the Internet and other data source to provide answers to questions such as the relative sentiment of users towards various brands. The system uses a controlled vocabulary list describing a specific aspect of entities of interest. Given an entity name, the system scans the whole content corpus to collect statistics on the words that occur most frequently in the context of the entity name, taking into account proximity information, to produce a weighted list of vocabulary terms describing the entity. Two entities can be compared by normalizing and comparing their weighted term lists. In some embodiments, the system performs these procedures efficiently by leveraging an N-gram web model. Thus, the system provides an automated way to compare two entities to derive information about how users feel about the entities at any given time.
    Type: Application
    Filed: December 12, 2011
    Publication date: June 13, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Pavel Dmitriev, Wei Zhuang
  • Publication number: 20130124493
    Abstract: Techniques for performing searches using models generated from data mining. These techniques may include receiving a query including a search string. If a portion of the search string matches with a category of relationship data, the portion of the search string may be replaced with a label of the category to generate a modified query. Based on the modified query and models generated from mining historical search data, intent of the query may be determined, and searches may be then performed.
    Type: Application
    Filed: November 14, 2012
    Publication date: May 16, 2013
    Applicant: Alibaba Group Holding Limited
    Inventor: Alibaba Group Holding Limited
  • Patent number: 8407261
    Abstract: An information processing apparatus according to the present invention comprises a generation section configured to generate an ordered tree structure by defining transition edges between nodes using, as transition conditions, respective constraints from one or more constraint pattern each including plural constraints, a search section configured to search for a second substructure K similar to a first substructure P from a root node by determining a set relation between transition conditions of respective transition edges, and a transition defining section configured to define an additional transition link from a tail node sKn of the second substructure K to a child node sPn+1 of the first substructure P, the additional transition link adding a constraint to be met by an indeterminant identified from the set relation, wherein a data structure for pattern matching is defined.
    Type: Grant
    Filed: June 30, 2009
    Date of Patent: March 26, 2013
    Assignee: International Business Machines Corporation
    Inventors: Teruo Koyanagi, Yuta Tsuboi
  • Publication number: 20130066898
    Abstract: One or more techniques and/or systems are disclosed for matching a target string to a known string. A target string is broken into one or more target terms, and the one or more target terms are matched to known terms in an index tree. The index tree comprises one or more known terms from a plurality of known strings, where the respective known terms in the index tree are associated with one or more known string IDs. A known term that is associated with a known string ID (in the index tree, and to which a target term is matched), is comprised in a known string, which corresponds to the known string ID. The target string can be matched to the known string using the known string's corresponding known string ID that is associated with a desired number of occurrences in the matching of the one or more target terms.
    Type: Application
    Filed: September 9, 2011
    Publication date: March 14, 2013
    Applicant: Microsoft Corporation
    Inventor: Enyuan Wu
  • Publication number: 20130060799
    Abstract: Systems and methods are disclosed for comparing selections of text to show differences between the two selections. The text may be selected from the same source or from two different sources. In one implementation, a system receives a first selection of text for comparison and places the selection in a first buffer. The system receives a second selection of text for comparison and places the second selection in a second buffer. The system compares the first buffer and the second buffer to determine differences and displays the differences. In some embodiments, the system may allow a user to choose two buffers from among a plurality of buffers for comparison.
    Type: Application
    Filed: September 1, 2011
    Publication date: March 7, 2013
    Inventor: Deepak Massand
  • Publication number: 20130060786
    Abstract: A method and system are disclosed for conducting text-based searches of images using a visual signature associated with each image. A measure of string similarity between a query and an annotation associated with each entry in a first database is computed, and based upon the computed string similarity measures, a set of entries from the first database is selected. Each entry of the first database also includes an associated visual signature. At least one entry is then retrieved from a second database based upon a measure of visual similarity between a visual signature of each of the entries in the second database and the visual signatures of the entries in the selected set. Information corresponding to the retrieved entries from the second database is then generated.
    Type: Application
    Filed: September 2, 2011
    Publication date: March 7, 2013
    Applicant: Xerox Corporation
    Inventors: José Antonio Rodriguez Serrano, Florent Perronnin, Craig John Saunders
  • Publication number: 20130054552
    Abstract: A spatial and temporal memory system (STMS) processes input data to detect whether spatial patterns and/or temporal sequences of spatial patterns exist within the data, and to make predictions about future data. The data processed by the STMS may be retrieved from, for example, one or more database fields and is encoded into a distributed representation format using a coding scheme. The performance of the STMS in predicting future data is evaluated for the coding scheme used to process the data as performance data. The selection and prioritization of STMS experiments to perform may be based on the performance data for an experiment. The best fields, encodings, and time aggregations for generating predictions can be determined by an automated search and evaluation of multiple STMS systems.
    Type: Application
    Filed: August 25, 2011
    Publication date: February 28, 2013
    Applicant: NUMENTA, INC.
    Inventors: Jeffrey C. Hawkins, Ronald Marianetti, II, Anosh Raj, Subutai Ahmad
  • Publication number: 20130018911
    Abstract: Methods and systems for obtaining a scalable variable-length nonce are provided. An exemplary method for obtaining a scalable variable-length nonce includes obtaining an initial nonce candidate. The exemplary method also includes transmitting a query to determine whether the initial nonce candidate has already been assigned. When the initial nonce candidate has already been assigned, a combined nonce candidate is obtained. The combined nonce candidate includes characters appended to the initial nonce candidate. An exemplary system for obtaining a scalable variable-length nonce includes a transmitting engine that transmits a request associated with a current token. The exemplary system also includes a receiving engine that receives a message associated with the current token. The exemplary system further includes a token acquiring engine that obtains a new token, and an appending engine that appends the new token to the current token.
    Type: Application
    Filed: July 14, 2011
    Publication date: January 17, 2013
    Applicant: Google Inc.
    Inventor: Likai Liu
  • Publication number: 20120330994
    Abstract: Systems and methods are disclosed for identifying objects from a collection of objects of diverse types that match a pattern. In one implementation, a system includes a processor and a memory. The memory stores instructions that cause the processor to receive a pattern input text identifying the pattern to be matched. The pattern input text comprises one or more conditions and a match statement. The processor also receives a collection of objects of diverse types, where each object comprises at least an attribute and a value associated with the attribute. The processor analyzes each object to determine whether the object satisfies the match statement based on an application of the one or more conditions to the attributes and attribute values for the object. If the processor determines that the object satisfies the match statement, the processor creates an output comprising an indication of the object and the conditions that caused the match statement to be satisfied.
    Type: Application
    Filed: June 22, 2011
    Publication date: December 27, 2012
    Inventor: Trevor Tonn
  • Publication number: 20120278346
    Abstract: A system for frequent pattern mining uses two layers of processing: a plurality of computing nodes, and a plurality of processors within each computing node. Within each computing node, the data set against which the frequent pattern mining is to be performed is stored in shared memory, accessible concurrently by each of the processors. The search space is partitioned among the computing nodes, and sub-partitioned among the processors of each computing node. If a processor completes its sub-partition, it requests another sub-partition. The partitioning and sub-partitioning may be performed dynamically, and adjusted in real time.
    Type: Application
    Filed: April 27, 2011
    Publication date: November 1, 2012
    Applicant: Microsoft Corporation
    Inventors: Shi Han, Yingnong Dang, Song Ge, Dongmei Zhang
  • Publication number: 20120254210
    Abstract: Systems and methods are described for efficiently processing, searching and/or rewriting variable width encoded data, such as UTF-8 encoded data, will be described. Embodiments of the systems and methods modify and adapt search algorithms, such as the Horspool and Wu-Manber algorithms, to efficiently process and manage searching of variable width encoded text in large blocks of text, such as text that may be carried via a stream of packets thru a network device, such as an intermediary device.
    Type: Application
    Filed: March 27, 2012
    Publication date: October 4, 2012
    Inventors: Siva Kiran Dhulipala, Steven Craig Vernon
  • Publication number: 20120233185
    Abstract: String matching is a ubiquitous problem that arises in a wide range of applications in computer science, e.g., packet routing, intrusion detection, web querying, and genome analysis. Due to its importance, dozens of algorithms and several data structures have been developed over the years. A recent breakthrough in this field is the FM-index, a data structure that synergistically combines the Burrows-Wheeler transform and the suffix array. In software, the FM-index allows searching (exact and approximate) in times comparable to the fastest known indices for large texts (suffix trees and suffix arrays), but has the additional advantage to be much more space-efficient than those indices. This disclosure discusses an FPGA-based hardware implementation of the FM-index for exact and approximate pattern matching.
    Type: Application
    Filed: March 8, 2012
    Publication date: September 13, 2012
    Inventors: Walid A. Najjar, Edward Bryann C. Fernandez, Stefano Lonardi
  • Publication number: 20120197920
    Abstract: Various embodiments of the present invention provide systems and methods for data processing. For example, some embodiments of the present invention provide data processing circuits including a pattern detection circuit having at least two data detector circuits each operable to receive the same series of data samples and to provide a first detected data output and a second detected data output, respectively. In addition, the data pattern detection circuit includes a result combining circuit that is operable to assert a pattern found output based at least in part on the first detected data output and the second detected data output.
    Type: Application
    Filed: January 27, 2011
    Publication date: August 2, 2012
    Inventor: Viswanath Annampedu
  • Publication number: 20120185494
    Abstract: The present disclosure relates to identifying consuming entity behavior across domains. Identity records are stored in a memory accessible to a computing system. Each of the identity records comprises at least one attribute, and the identity records originate from a plurality of domains. A determination is made as to whether a first one of the identity records identifies a consumption entity that is identified by a second one of the identity records. The first and the second identity records originate from separate ones of the domains, and the second one of the identity records is associated with a persistent key. The persistent key is associated with the consumption entity. The first identity record is associated with the persistent key if the first identity record is determined to identify the consumption entity.
    Type: Application
    Filed: January 19, 2011
    Publication date: July 19, 2012
    Applicant: VUELOGIC, LLC
    Inventors: Ronnie J. Garmon, Jim J. Deffenbaugh
  • Publication number: 20120179702
    Abstract: Proposed is a method for naturally and efficiently setting metadata in the daily process of searching for files. On a file search screen, there is provided a function of determining the candidate metadata of a metadata-nonregistered file, and initiating entry of metadata with the candidate metadata having been set. Determination of the candidate metadata is performed with any of the three following methods: a method of designating as a candidate a character string of a matched search keyword described in regular expression, a method of designating as a candidate a file path or a character string in a file that matches a keyword dictionary, and a method of designating as a candidate metadata that frequently appears in metadata-registered files.
    Type: Application
    Filed: September 30, 2010
    Publication date: July 12, 2012
    Applicant: HITACHI SOLUTIONS, LTD.
    Inventors: Yasuyuki Nozaki, Toshiko Matsumoto, Matsuharu Oba
  • Publication number: 20120179724
    Abstract: Database elements are inserted into a database object by processing each of a plurality of operations in a sequential order within a first processing round to insert the database elements into the database objects, where processing for at least one operation in the order becomes suspended due to a resource request, and where at least one successive operation is initiated in response to suspension of one or more prior operations to enable prefetching of information for processing the operations. Each suspended operation is re-processed with the prefetched information in one or more additional processing rounds until processing of the operations is completed.
    Type: Application
    Filed: March 16, 2012
    Publication date: July 12, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Robert W. Lyle, Ping Wang
  • Publication number: 20120179718
    Abstract: There is provided a technique for automatically acquiring metadata with respect to various organizations which significantly reduces the man-hours required to prepare models for metadata extraction. With a pair comprising a document and metadata appearing therein as input, using a layout feature, and proximate text string and partial text string features with respect to metadata and a text string that is not metadata, the use of the layout feature, the proximate text string and the partial text string with respect to the automatic acquisition of metadata is automatically configured (see FIG. 1).
    Type: Application
    Filed: July 23, 2010
    Publication date: July 12, 2012
    Applicant: HITACHI SOLUTIONS, LTD.
    Inventor: Toshiko Matsumoto
  • Publication number: 20120179699
    Abstract: A data structure comprises a clump header table and an inline tree data structure. The inline tree, representing filterable data fields of hierarchically organized data records, comprises an alternating sequence of first-level binary string segments, each followed by one or more corresponding second-level binary string segments. Each clump header record includes an indicator of a location in the inline tree of corresponding binary string segments. A dedicated, specifically adapted conversion program generates the clump header file and the inline tree for storage on any computer-readable medium, and the inline tree can be read entirely into RAM to be searched or filtered. A dedicated, specifically adapted search and filter program is employed to list or enumerate retrieved data records. Run-time computer code generation can reduce time required for searching and filtering. One example includes spatial searching and filtering of data records that include spatial coordinates as data fields.
    Type: Application
    Filed: January 10, 2012
    Publication date: July 12, 2012
    Inventors: Roy W. Ward, David S. Alavi
  • Publication number: 20120173574
    Abstract: An information retrieving apparatus is provided which has excellent usability using voice inputs. Search queries input by a user via a touch panel or the like are used as voice recognition vocabulary. Further, the search queries are edited by use of morpheme information and/or other information in a database, thereby providing such voice recognition vocabulary that can be easily uttered by the user.
    Type: Application
    Filed: September 9, 2010
    Publication date: July 5, 2012
    Applicant: Clarion Co., Ltd.
    Inventors: Takeshi Homma, Hiroaki Kokubo, Yoshiyuki Mizuno
  • Publication number: 20120166440
    Abstract: A system and method for searching a document for a query pattern. A plurality of streams may be stored each including a linear sequence of nodes. Each stream may be associated with nodes having a common label in a data tree of the document. A query pattern may be searched for in the streams by executing a plurality of threads. Each of two or more of the threads may be used to search different sub-streams of the plurality of streams. Each of the different sub-streams searched for by each thread in each stream may be uniquely correlated with one or more disjoint sub-trees of a partition of the tree into a plurality of sub-trees. The two or more of the plurality of threads may be executed in parallel. A result of the query pattern search may be generated using at least one of the threads.
    Type: Application
    Filed: January 28, 2011
    Publication date: June 28, 2012
    Inventors: Oded SHMUELI, Lila Shnaiderman
  • Publication number: 20120158758
    Abstract: A comparison device for comparing test pattern files of a wafer tester includes a storage unit and a processing unit. The comparison device stores a first to-be-compared file and a second to-be-compared file into the storage unit. The first to-be-compared file and the second to-be-compared file are text files respectively. The processing unit reads the first to-be-compared file and the second to-be-compared file from the storage unit to process and executes comparison operation, so as to generate a comparison result. The comparison operation is to compare the words in a first section of the first to-be-compared file with the words in a second section of the second to-be-compared file in a one-to-one manner.
    Type: Application
    Filed: February 18, 2011
    Publication date: June 21, 2012
    Applicant: King Yuan Electronics Co., Ltd.
    Inventor: Fu-Tai CHEN
  • Publication number: 20120158768
    Abstract: The present invention extends to methods, systems, and computer program products for decomposing and merging regular expressions. Embodiments of the invention decompose a regular expression into multiple simple keyword graphs, merge those keyword graphs in a compact and efficient manner, and produce a directed acyclic graph (DAG) that can execute a simplified regular expression alphabet. Several of these regular expression DAG's can then be merged together to produce a single DAG that represents an entire collection of regular expressions. DAGs along with other text processing algorithms and a heap collection can be combined in a multi-pass approach to expand the regular expression alphabet.
    Type: Application
    Filed: December 15, 2010
    Publication date: June 21, 2012
    Applicant: Microsoft Corporation
    Inventors: Charles William Lamanna, Mauktik H. Gandhi, Jason Eric Brewer
  • Publication number: 20120143884
    Abstract: An apparatus for detecting presence of a selected complex string in a data stream has a primary processor, detecting coherent strings of the selected complex string; and a secondary processor, pipelined to the primary processor, and detecting ambiguous strings interleaving said coherent strings. The primary processor is configured to determine a number of coherent strings in the selected complex string, each coherent string preceded by a prefix of indefinite characters, with a last coherent string succeeded by a suffix of indefinite characters; and to locate sequential portions of the data stream that are identical to successive coherent strings of the selected complex string. The secondary processor is configured to ascertain congruence of portions of the data stream preceding and interleaving the sequential portions to prefixes of the coherent strings; and to ascertain congruence of the suffix to a corresponding portion of said data stream following the sequential portions.
    Type: Application
    Filed: February 12, 2012
    Publication date: June 7, 2012
    Inventor: Kevin Gerard BOYCE
  • Publication number: 20120143896
    Abstract: The present disclosure is directed to systems and methods including receiving a source code at a parser, the source code comprising a statement, the statement comprising a dynamic character string including one or more relation names, each of the one or more relation names defining a relation between business objects, processing the statement using one or more processors to access at least one business object, the at least one business object being stored in machine-readable memory, the at least one business object corresponding to at least one of the one or more relation names, and having business object data associated therewith, and generating an output based on the business object data.
    Type: Application
    Filed: December 2, 2010
    Publication date: June 7, 2012
    Applicant: SAP AG, A GERMAN CORPORATION
    Inventor: Juergen GATTER
  • Publication number: 20120136890
    Abstract: A behavior pattern recognition method, system and a computer application program thereof are presented. The method is applicable to an electronic device which has a storage unit for storing multiple sets of behavior record information, and the method includes the following step. Firstly, a first detecting unit acquires first behavior feature information, a collaboration network module acquires at least one second detecting unit having coherence with the first detecting unit. Then, the at least one second detecting unit acquires at least one second behavior feature information, and a processing unit compares the at least one second behavior feature information and the behavior record information to generate at least one comparison result. Finally, a behavior definition represented by the first behavior feature information is determined according to the comparison result.
    Type: Application
    Filed: December 15, 2010
    Publication date: May 31, 2012
    Inventors: Yung-Chuan WEN, Min-Siong Liang
  • Publication number: 20120136886
    Abstract: A method of incrementally refining queries and updating query result lists without requiring a user to provide an explicit indicator of query submission. In the method, one or more query related character patterns are defined. Entry of query defining characters by a user is monitored to detect entry of a defined query related character pattern. The user is provided with one or more suggested query refinement options and an updated query result list each time a defined query related character pattern is detected. The query related character patterns may be a character pattern that indicates a word has been input by the user.
    Type: Application
    Filed: February 8, 2012
    Publication date: May 31, 2012
    Applicant: Microsoft Corporation
    Inventors: Brett D. Brewer, Oliver Hurst-Hiller
  • Patent number: 8189931
    Abstract: A method and system are disclosed for matching input character sequences in a set of input patterns. The method comprises the steps of analyzing the set of input patterns, creating a pattern cluster look-up table (PCLT) based on said input patterns, and defining an offset value k. The PCLT is used to find, for each sequence s and offset k, a set of candidate patterns that can possibly match s, the set of candidate patterns is searched for patterns that match s, and all found matching patterns and sequences are reported.
    Type: Grant
    Filed: January 4, 2008
    Date of Patent: May 29, 2012
    Assignee: International Business Machines Corporation
    Inventors: Tien Huynh, Stanislav Polonsky, Isidore Rigoutsos
  • Publication number: 20120124103
    Abstract: System, method and storage medium loaded with a program are provided for preparing tests in which a certain set of questions may be ordered in different order amongst different copies of the test. There further provided a method for setting different order of answers of a multiple-choice type of test for each question, amongst different copies of the test. Further is provided a method and tool for keeping the decoding key for deciphering and reconstructing the order of the questions and answers within the test itself thus allowing checking the answers and providing statistics on virtually any computer, regardless of the computer on which the test was composed. There is further provided a system and method for processing multiple types of inputs for the questions, such as text, graphics, images, equations and the like.
    Type: Application
    Filed: August 28, 2010
    Publication date: May 17, 2012
    Inventor: Ofir Epstein