Indexing, E.g., Of Xml Tags, Etc. (epo) Patents (Class 707/E17.123)
  • Patent number: 10303672
    Abstract: A system includes circuitry configured to: read a plurality of character information and a plurality of identifiers that are included in a text file; determine whether a character information among the plurality of character information is included between the at least one pair of identifiers among the plurality of identifiers in the text file; and associate the character information with the at least one pair of identifiers when it is determined that the character information is included between the at least one pair of identifiers.
    Type: Grant
    Filed: January 9, 2017
    Date of Patent: May 28, 2019
    Assignee: FUJITSU LIMITED
    Inventors: Masahiro Kataoka, Takahiro Murata, Naoki Akiyama
  • Patent number: 9576008
    Abstract: A system includes circuitry configured to: read a plurality of character information and a plurality of identifiers that are included in a text file; determine whether a character information among the plurality of character information is included between the at least one pair of identifiers among the plurality of identifiers in the text file; and associate the character information with the at least one pair of identifiers when it is determined that the character information is included between the at least one pair of identifiers.
    Type: Grant
    Filed: April 18, 2014
    Date of Patent: February 21, 2017
    Assignee: FUJITSU LIMITED
    Inventors: Masahiro Kataoka, Takahiro Murata, Naoki Akiyama
  • Patent number: 9424240
    Abstract: Embodiments are described for capturing, displaying, and navigating annotations in an electronic document. In some embodiments, a selection of an object in the electronic document is received, and an annotation associated with the selected object is received. The annotation is maintained in a list of annotations. The annotation can also be displayed. In some embodiments, a selection of the annotation is received and in response to the selection of the annotation being received, a user may be navigated to a location of the selected object in the electronic document.
    Type: Grant
    Filed: October 7, 2013
    Date of Patent: August 23, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: John L. Beezer, David M. Silver, Marco M. DeMello
  • Publication number: 20140115012
    Abstract: A method, system, and computer program product for of database schema management. The computer implemented method for data model optimization using multilevel entity dependency analytics commences by accessing a multilevel schema data structure, determining the relationship lineages present in the multilevel schema data structure and generating a dependency table using the relationship lineage. Then, using the dependency table the computer implemented method performs at least one of, a high impact analysis, a referential integrity analysis, or a conformance analysis. In some embodiments the results of the analysis are reported to a user and in some embodiments the results of the analysis applied to the multilevel schema data structure.
    Type: Application
    Filed: October 23, 2012
    Publication date: April 24, 2014
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventor: Pradipta DAS
  • Publication number: 20140046949
    Abstract: New text associated with an updated document is compared with existing embedded index items within the updated document and with existing contextual text within the updated document associated with the existing embedded index items. A determination is made as to whether any of the new text includes candidate text for at least one new index item that is not already indexed within the existing embedded index items using a contextual index compendium including contextual index item generation rules and the existing contextual text within the updated document associated with the existing embedded index items. At least one new index item is generated, using the contextual index item generation rules, in response to determining that at least one portion of the new text includes the candidate text that is not already indexed within the existing embedded index items.
    Type: Application
    Filed: August 7, 2012
    Publication date: February 13, 2014
    Applicant: International Business Machines Corporation
    Inventor: Neil G. Bloomfield
  • Patent number: 8626951
    Abstract: An interoperability system abstracts the protocols used by multiple network applications into an interoperability framework, thereby allowing the network applications to interoperate with each other and/or with modules for providing enhanced functionalities. The interoperability framework includes a number of adapters that modularize the components needed for interoperation and abstracts content from the underlying protocols and procedures used by the network applications. The interoperability framework includes a resource control that enables the network applications to access shared resources, such as data and content used by the network applications, thus allowing the applications to interoperate and for functionality enhancements to be added.
    Type: Grant
    Filed: April 23, 2007
    Date of Patent: January 7, 2014
    Assignee: 4DK Technologies, Inc.
    Inventors: Tamara L. Casey, David P. Martin
  • Publication number: 20130218915
    Abstract: Software validation is provided for a breakout system having multiple subsystems at the edge of a mobile data network. The software validation utilizes one or more trusted platform modules (TPM) to secure multiple subsystems including virtual machines in the breakout system. Hash values for the software in the various subsystems are placed in Platform Configuration Registers (PCRs) of the TPM. The TPM cryptographically signs quotes, which are a collection of hash values from the PCRs. The breakout system produces an extensible markup language (XML) file with the signed quotes related to the subsystems and sends them to a network management system for verification. The network management system validates the software configured on the breakout system using a public key to access the quotes and compares the values to known good values stored in an inventory record associated with the specific breakout system being validated.
    Type: Application
    Filed: February 22, 2012
    Publication date: August 22, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ronald L. Billau, Vincenzo V. Di Luoffo, Terence K. Hennessy
  • Publication number: 20120254262
    Abstract: There is disclosed a method, apparatus and computer program for parsing a message using a message model. A message is received comprising one or more message fields. This message is stored as a reference bitstream. The message model is used to compare a message field in one or more subsequently received messages with the equivalent field in the reference bitstream. Finally, responsive to determining that a message field in said one or more subsequently received messages matches a field in the reference bitstream a predetermined number of times, storing parser outputs for the matching field for future reuse.
    Type: Application
    Filed: June 18, 2012
    Publication date: October 4, 2012
    Applicant: International Business Machines Corporation
    Inventor: Timothy Kimber
  • Publication number: 20120131067
    Abstract: Systems, methods and computer readable medium for migrating Information Management System (IMS) Full-Function databases to IMS High Availability Large Databases (HALDBs) are described. Full Function database conversion operations in accordance with this disclosure assign a unique identifier to each segment having a physically paired logically related child segment. These unique identifiers may then be used during HALDB load operations to identify a segment's physically paired logically related segment. Use of the disclosed unique identifiers permit Full Function database conversion operations to avoid the input-output (I/O) and compare operations needed by prior art unload techniques to completely identify physically paired logically related segments.
    Type: Application
    Filed: November 24, 2010
    Publication date: May 24, 2012
    Applicant: BMC SOFTWARE, INC.
    Inventor: David Albert Smith
  • Publication number: 20120124005
    Abstract: A method of recording user activity of a user on a comprising the steps of providing a file to a user from a network; transmitting and storing the file to a file archive; gathering user activity data for each file accessed; associating user activity data with each file accessed; and transmitting and storing user activity data; categorizing user activity data; ranking user activity data; compressing/decompressing user activity data; removing redundant data from file archive; collecting file enhancement data, associating the file enhancement data with a file being currently accessed by the user, and transmitting the file enhancement data; storing the actual content accessed by the user and the file enhancement data as stored material; indexing the stored material; receiving a command, including input parameters, for conducting a search of the stored material, the input parameters including keyword and file enhancement data from the user; generating a customizable user activity report.
    Type: Application
    Filed: December 20, 2011
    Publication date: May 17, 2012
    Inventors: George Eagan, Prabhdeep Singh
  • Publication number: 20110304774
    Abstract: Embodiments are disclosed that relate to the automatic tagging of recorded content. For example, one disclosed embodiment provides a computing device comprising a processor and memory having instructions executable by the processor to receive input data comprising one or more of a depth data, video data, and directional audio data, identify a content-based input signal in the input data, and apply one or more filters to the input signal to determine whether the input signal comprises a recognized input. Further, if the input signal comprises a recognized input, then the instructions are executable to tag the input data with the contextual tag associated with the recognized input and record the contextual tag with the input data.
    Type: Application
    Filed: June 11, 2010
    Publication date: December 15, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Stephen Latta, Christopher Vuchetich, Matthew Eric Haigh, JR., Andrew Robert Campbell, Darren Bennett, Relja Markovic, Oscar Omar Garza Santos, Kevin Geisner, Kudo Tsunoda
  • Patent number: 8051372
    Abstract: A system and method for automatically detecting and extracting semantically significant text from a HTML document associated with a plurality of HTML documents is disclosed. The method may include receiving a HTML document, parsing the HTML document into a parse tree, segmenting the parse tree into one or more segments of one or more unique paths, processing the one or more segments based at least the HTML document, and extracting one or more processed segments from the at least the HTML document based on a predetermined number.
    Type: Grant
    Filed: April 12, 2007
    Date of Patent: November 1, 2011
    Assignee: The New York Times Company
    Inventor: Evan Stapleton Sandhaus
  • Publication number: 20110252039
    Abstract: A system, apparatus and method for processing content on a computing device is provided whereby, in one embodiment, a domain object model and associated style sheet are received at a computing device. The computing device is configured to parse the domain object model and determine styles from the style sheet and to store those styles in an interim storage for later use.
    Type: Application
    Filed: October 14, 2010
    Publication date: October 13, 2011
    Applicant: RESEARCH IN MOTION LIMITED
    Inventor: Shannon Ralph Normand White
  • Publication number: 20110246474
    Abstract: [Problems to be Solved by the Invention] It is an object of the present invention to provide a data management apparatus capable of efficiently managing various data that is transmitted and received via a network, by automatically and adequately generating an ID with which the identity of pieces of data can be verified and associating the ID with the pieces of data, even if specifying information such as a marking assigned to the piece of data is modified by a completely independent server.
    Type: Application
    Filed: August 5, 2009
    Publication date: October 6, 2011
    Inventors: Koichi Abe, Shogo Tsubouchi, Shinichi Yasuda
  • Patent number: 7991762
    Abstract: Crawling pages is disclosed. Pages are crawled up to a target number of pages. Additional pages, that have an importance that is equal to or greater than an importance threshold, are crawled beyond the target number of pages. In some embodiments, pages having an importance less than an importance threshold are deleted.
    Type: Grant
    Filed: March 24, 2009
    Date of Patent: August 2, 2011
    Assignee: Google Inc.
    Inventors: Jesse L. Alpert, Praveen K. Tammana, Yair Kurzion
  • Publication number: 20110106811
    Abstract: A method and apparatus are provided for building and using a persistent XML tree index for navigating an XML document. The XML tree index is stored separately from the XML document content, and thus is able to optimize performance through the use of fixed-sized index entries. The XML document hierarchy need not be constructed in volatile memory, so creating and using the XML tree index scales even for large documents. To evaluate a path expression including descendent or ancestral syntax, navigation links can be read from persistent storage and used directly to find the nodes specified in the path expression. The use of an abstract navigational interface allows applications to be written that are independent of the storage implementation of the index and the content. Thus, the XML tree index can index documents stored at least in a database, a persistent file system, or as a sequence of in memory.
    Type: Application
    Filed: October 30, 2009
    Publication date: May 5, 2011
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Anguel Novoselsky, Zhen Hua Liu, Thomas Baby
  • Publication number: 20110072004
    Abstract: A system, method and program product for processing an inputted XPath query against an XML document. A method is disclose that includes: generating a path index and an MTree structure index from the XML document using a computing device, wherein the MTree structure index has at least one qpath; executing a query against the path index to generate an initial sequence containing a node for each qpath in the XML document that satisfies the query; generating a hash map from the initial sequence from an MTree structure index containing path ids that are located along qpaths in a second MTree structure index; and testing the path id of each node located along a qpath of the Mtree structure index against the path id in the hash map to generate a result sequence.
    Type: Application
    Filed: September 24, 2009
    Publication date: March 24, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Primo M. Pettovello
  • Publication number: 20100305922
    Abstract: Embodiments relate to generating active links between model objects. A modeling client can host modeling logic and an application programming interface (API) to create, access, manipulate, and import/export modeling objects used in modeling applications, such as engineering, medical, financial, and other modeling platforms. The source data accepted into the modeling client can include consumer or business-level applications, whose spreadsheet, database or other content can be extracted and encapsulated in object-oriented format, such as extensible markup language (XML) format. Links can be inserted in the resulting model object to link to external resources, such as additional model objects, services, local or remote modeling tools, or other resources. The model object can share, exchange, or combine data from other model object(s), as well as instantiate functions hosted in other model object(s). Multiple links can be inserted to multiple model objects in linked list, node, or other configurations.
    Type: Application
    Filed: May 29, 2009
    Publication date: December 2, 2010
    Inventor: Eric WILLIAMSON
  • Patent number: 7739292
    Abstract: A distributed system includes client- and server-side components that, together, allow XML schemas or the like to be managed in a collaborative way across an enterprise environment. The server-side component, or server, is used to assemble a collection of one or more schemas. The server exposes the collection of schemas to one or more client components that connect to the server. Such connections may be made over a computer network. The client component preferably exports a display interface through which a user can visualize dependencies between schemas, e.g., by dragging them into a workspace and viewing all related schemas and how they are connected, as well as what elements and other components they define. In this way, the client component provides a high-level view of a larger (e.g., enterprise-wide) information architecture.
    Type: Grant
    Filed: September 28, 2005
    Date of Patent: June 15, 2010
    Assignee: Altova GmbH
    Inventors: Alexander Falk, Vladislav Gavrielov
  • Patent number: 7730162
    Abstract: A method of automatically generating a markup language message from an application specific interface object, as well as generating an application specific interface object from a markup language message is provided. The method can include receiving an application specific interface object and identifying an object table specified therein. Objects listed in the object table can be automatically identified. The objects can be serialized by representing the objects in the markup language message. The resulting markup language message can be a representation of the application specific interface object.
    Type: Grant
    Filed: September 17, 2007
    Date of Patent: June 1, 2010
    Assignee: International Business Machines Corporation
    Inventors: Peter J. Lynch, Brenda H. High
  • Publication number: 20100070541
    Abstract: The present disclosure provides a student information state reporting system. The disclosed system allows a user to define a form that includes questions for capturing data elements related to a state report. The user also associates the form with a snapshot for automatic synchronization of at least one of the data elements. In addition, the user defines at least one field in the form to be included in the snapshot and automatically synchronized. Next, the user 118 associates the form with an output definition that matches a state report format. Preferably, an output based on the output definition for the state report is automatically scheduled, and the automatic synchronization is only applied when in an on-line mode.
    Type: Application
    Filed: September 3, 2009
    Publication date: March 18, 2010
    Applicant: METAPHOR SOFTWARE, INC.
    Inventor: Scott John Orr
  • Publication number: 20100030726
    Abstract: A mechanism is provided for improving the performance of particular database queries operating on a view comprising binary-encoded XML. The key to improving performance is to determine whether a XML index is available to improve the computation of XPath data before deriving an optimized execution plan for a particular query. When a XML index is not available to help expedite the computation of binary-encoded XML, then deferring the computation of the XPath data until after the view merge stage avoids unnecessary computation of XPaths that are part of the view but not needed for the particular query.
    Type: Application
    Filed: July 29, 2008
    Publication date: February 4, 2010
    Inventors: Sivasankaran Chandrasekar, Balasubramanyam Sthanikam, Sam Idicula, Nipun Agarwal
  • Publication number: 20090319295
    Abstract: A computer-implemented method for identifying and assessing public health events, and a corresponding system and apparatus, includes capturing public health-related information from structured and unstructured sources, where the information is contained in one or more documents, extracting meta-data from the captured public health-related information, creating an index of the extracted meta-data; archiving the meta-data and the documents, where the index links meta-data to its associated document, processing the extracted meta-data according to one or more detection algorithms to determine if an anomaly exists, and where an anomaly exists, providing a public health event notification, and monitoring and evaluating the responses to the public health events.
    Type: Application
    Filed: September 21, 2006
    Publication date: December 24, 2009
    Inventors: Taha A. Kass-Hout, Massimo Mirabito
  • Publication number: 20090259625
    Abstract: A method for tagging data content, the method comprising, receiving a tag associated with a data content having a data content identifier from a first user, associating the tag with a tag identifier, associating the tag with a user identifier of the first user, saving the tag identifier, the data content identifier, the tag, and a privacy designation associated with the tag identifier in a first table responsive to determining that the tag associated with the data content identifier is not present in the first table, and saving the user identifier and the tag identifier in a second table responsive to determining that the user identifier and the tag identifier are not present in the second table.
    Type: Application
    Filed: April 14, 2008
    Publication date: October 15, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Katsuhisa Kataoka
  • Publication number: 20090226046
    Abstract: A method of characterizing a program includes defining a scene as a portrayal of an emotion of a first character and identifying each scene within a first program to apportion the first program into a series of scenes. An emotional profile of the first program is built according to the series of scenes. Recommendation of a program includes correlating the emotional profile of the first program with a user preference profile.
    Type: Application
    Filed: October 8, 2008
    Publication date: September 10, 2009
    Inventor: Yevgeniy Eugene Shteyn
  • Publication number: 20090094190
    Abstract: Methods, systems and computer program products for displaying tag words for selection by users engaged in social tagging of content accessible via a communications network, are provided. A tag cloud, tag word inventory curve and slider control are displayed within a graphical user interface. The tag cloud is a visual representation of an inventory of tag words. The tag word inventory curve is a graphical representation of the words in the inventory by frequency of usage by others. The slider control is responsive to user movement, and is movable within a range that spans the length of the tag word inventory curve. Movement of the slider control changes the number of tag words from the inventory displayed in the tag cloud according to frequency of usage by others.
    Type: Application
    Filed: October 8, 2007
    Publication date: April 9, 2009
    Inventor: R. Todd Stephens
  • Publication number: 20090094189
    Abstract: Methods, systems and computer program products for managing tags added by users engaged in social tagging of content accessible via a communications network include identifying critical words associated with content accessed by a user, and recommending one or more content-descriptive tags to the user based on critical words identified in the content. Identifying critical words in content includes assigning a weighted value to content words, for example, based on occurrence and location of content words within the content. Identifying critical words in content also includes assigning a weighted value to content words, for example, based on the position on a content word inventory curve, such as a “long tail” curve. The position on a long tail curve defines popularity of content words in other social tags currently in use.
    Type: Application
    Filed: October 8, 2007
    Publication date: April 9, 2009
    Inventor: Robert Todd Stephens
  • Publication number: 20090083335
    Abstract: This invention relates to the development of a business program in an XML driven architecture. First, XML data corresponding to a form screen is analyzed and a business class corresponding to a tag included in the XML data is specified. Then, by referring to a business class manager in which the business classes to be created are registered, it is judged whether or not the specified business class has not been registered in the business class manager, and if the specified business class has not been registered in the business class manager, the specified business class is registered in the business class manager. After this, each of the business classes has to be specifically programmed. On the other hand, at the execution time of the programmed business classes, XML data including tags corresponding to data inputted or selected for the form screen is obtained, and the business classes that correspond to the tags included in the XML data and are programs to carry out a processing relating to the tags.
    Type: Application
    Filed: November 25, 2008
    Publication date: March 26, 2009
    Applicant: FUJITSU LIMITED
    Inventor: Nobuo Togahara
  • Publication number: 20090077107
    Abstract: Methods and systems for managing and analyzing multi-dimensional data are provided. Example embodiments provide a Meta-Object Data Management System “MODMS,” which enables users to arrange and to rearrange the hierarchical relationships of the data on an ad-hoc basis and allows the data to be analyzed using any set of attributes (dimensions) while the system is running. The MODMS represents heterogeneous data in a normalized (standardized) fashion using an object type management system that allows the arbitrary coercion of one type of object into another different type of object and automatically resolves attribute dependencies. In one embodiment, the MODMS comprises an object type management subsystem; a meta-object instantiation subsystem; one or more data repositories that hold, for example, the data used to populate objects and object type definitions; and an input/output interface.
    Type: Application
    Filed: November 25, 2008
    Publication date: March 19, 2009
    Inventors: John Scumniotales, Eric Burke, Robert Cahn, Thomas Fannon, Mitchel Burns, Michael Lester, Jeff Hill
  • Publication number: 20090063533
    Abstract: An apparatus and method are disclosed for finding and returning sub-trees from within a preselected XML document that match an XQuery FLWOR expression having a binding order, in which a match graph is generated from an XML index of node paths for a collection of XML documents, where the collection includes the preselected XML document and the match graph is first traversed by a plurality of cursors in a reverse binding order and traversed by the plurality of cursors in forward binding order.
    Type: Application
    Filed: August 27, 2007
    Publication date: March 5, 2009
    Applicant: International Business Machines Corporation
    Inventor: Edison Lao Ting
  • Publication number: 20080320030
    Abstract: The methods and systems described herein may involve providing a standard syntax for specifying an item of lifeotype information, wherein the syntax includes at least one markup language tag, the tag specifying a type of information included in the lifeotype information. The methods and systems described herein may involve providing an item of lifeotype information, associating at least one tag with the item of lifeotype information, and documenting the lifeotype information, association and the at least one tag in a standard syntax. The methods and systems described herein may involve providing an item of lifeotype information, defining standard fields to be populated in connection with the item of lifeotype information, populating the standard fields, and aggregating the standard fields and lifeotype information across at least two items of lifeotype information.
    Type: Application
    Filed: February 19, 2008
    Publication date: December 25, 2008
    Inventors: John M. Stivoric, Eric Teller, David Andre, John A. Monocello
  • Publication number: 20080281832
    Abstract: The invention provides a system and method for acquiring, normalizing, indexing and storing information related to RSS feeds and their constituent content items in a searchable database. In some embodiments, the invention includes an application that accesses electronic feed documents for a plurality of RSS feeds to ascertain information regarding the feed and content items within the feed. The information is then parsed to identify characteristics of the feeds and their content items. The characteristics of the feeds and content items can then be stored in a common format in a database. This enables, inter alia, creation of custom feeds by utilizing the normalized feed data.
    Type: Application
    Filed: May 8, 2007
    Publication date: November 13, 2008
    Inventors: Jeffrey L. Pulver, James R. Marcus, Suman Roy, Matthew Stokes, Dmitry V. Yashin
  • Publication number: 20080275850
    Abstract: An appropriate tag is assigned to an image in comparatively simple fashion. An image of interest to be tagged is selected and tags that have already been assigned to the selected image of interest are displayed in a present-tag display area. Tags having a high frequency of appearance are extracted from among tags that have been assigned to images having tags identical with the tags that have already been assigned to the image of interest, these images being taken from among images that have been stored in an image database. The extracted tags are displayed in a tag candidate display area as candidate tags. Since the tags displayed in the tag candidate display area often are tags related to the selected image of interest, they are tags suitable for assignment to the image of interest.
    Type: Application
    Filed: March 13, 2008
    Publication date: November 6, 2008
    Inventor: Arito ASAI
  • Publication number: 20080263085
    Abstract: Defining target data types within an instance of a target data type definition for inclusion in a reference data type definition to declare expected entity relationships in a model. The model corresponds to, for example, software, a business process, or an organizational structure. The target data type defines one or more elements with target data. The target data describes entities that are expected to be related to the reference entity according to a predefined entity relationship. The existing reference data type definition is used to declare the expected entity relationships. In an embodiment, the defined target data type includes a tag in an eXtensible Markup Language (XML) document.
    Type: Application
    Filed: April 20, 2007
    Publication date: October 23, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Andrea Rose Westerinen, Nigel Graham Cain
  • Publication number: 20080195668
    Abstract: A method and system are provided for extracting a valid, self-contained fragment for a node in a XML document stored in a database management system. An XML index is used to identify a location in which XML fragment data corresponding to the node is located. Ancestors of the node are identified and examined for any information needed for the proper interpretation of the fragment. If an ancestor node contains such needed information, this information is patched into the XML fragment to ensure that the fragment is a valid, self-contained XML fragment.
    Type: Application
    Filed: April 16, 2008
    Publication date: August 14, 2008
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Sivasankaran Chandrasekar, Ashish Thusoo, Ravi Murthy, Nipun Agarwal, Eric Sedlar, Sreedhar Mukkamalla
  • Publication number: 20080162538
    Abstract: A second sequence of visual information is produced, based on a first sequence of visual information. An XML document is generated, which identifies edits to be made to the first sequence of visual information for producing the second sequence of visual information. The XML document includes an element that corresponds to a cut list. The cut list element includes multiple item elements, which are subordinate to the cut list element within the XML document. The multiple item elements include at least one item element that represents an effect; at least one that represents a cut and/or at least one that represents a gap. The XML document is used to determine how to edit the first sequence of visual information, to produce the second sequence of visual information.
    Type: Application
    Filed: December 29, 2006
    Publication date: July 3, 2008
    Inventor: Michael Loran Kary
  • Publication number: 20080098020
    Abstract: Techniques are provided for incrementally maintaining an XML index built to access XML data that is encoded in binary XML form. Rather than delete and reinsert index entries of all the nodes of a modified XML document, only the index entries of the affected nodes are modified. Consequently, the order key values stored in the index may become inconsistent with the current hierarchical locations of the nodes to which the order key values correspond. Techniques are described for resolving the inconsistencies, and for addressing additional problems that result when the XML index is path-subsetted.
    Type: Application
    Filed: March 7, 2007
    Publication date: April 24, 2008
    Inventors: Nitin Gupta, Sivasankaran Chandrasekar, Sam Idicula, Nipun Agarwal
  • Publication number: 20080077554
    Abstract: Methods and devices for optimizing the processing of documents of the XML type comprising components are described. After having identified the required XML document and the processing mode that is to be performed on this document, for example forward or backward, the client sends a request comprising this information. On receipt of this request, the server accesses the document identified and the sequencing according to the processing mode that has to be performed before transmitting it. If the processing mode is of the backward type, it is not necessary to process all the document in order to transmit it, it may be transmitted gradually as sequencing takes place. The algorithms for sequencing and processing the components of an XML document are adapted to process the simple components and the components having circular references.
    Type: Application
    Filed: August 31, 2007
    Publication date: March 27, 2008
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Youenn Fablet
  • Patent number: 7340459
    Abstract: According to the present invention, apparatus and methods are provided to enable a user to locate and retrieve sets of information relevant to search criteria specified in a search query submitted by the user. Search results include not only a list of information sets matching with the search criteria, but also the preserved structure of any tags used in annotating the information set according to a structured mark-up language such as XML. A user may select a tag from a presented list of the returned tag structures, and the apparatus lists those documents containing the selected tags. The list of tags is then adjusted to include the selected tag and any other of the returned tags contained in the listed documents. Further tag selection from the adjusted list leads to a further refinement of the listed documents, enabling the user to navigate the search results on the basis of tag information.
    Type: Grant
    Filed: September 24, 2001
    Date of Patent: March 4, 2008
    Assignee: British Telecommunications public limited company
    Inventors: Uwe Krohn, Nicholas J Davies, Richard Weeks
  • Publication number: 20080010313
    Abstract: A method and a computer system for indexing and searching the data content of nested field records, such as those in Extensible Markup Language (XML). The system includes an indexing and searching engine that constructs an improved full-text search index on the input XML data and then performs searches using the index. The system supports exact matches and partial matches using a wildcard character. The method transforms the input XML data into a form that encodes the data structural information by suffixing each word with its corresponding field qualifiers or an equivalent numerical pattern thereof. The resulting encoded words are then stored in a full-text index structure. Various types of full-index search may be performed. One alternative embodiment is to combine string matching and numeric or integer pattern matching to identify a particular word in a particular field.
    Type: Application
    Filed: September 20, 2007
    Publication date: January 10, 2008
    Inventor: David THEDE
  • Publication number: 20070294282
    Abstract: In general, in one aspect, a method for dynamic markup language schema enumeration includes receiving a request for schema file data, the schema file data for validating markup language data, using a database query to retrieve from a database values in a database table that are valid values for the markup language data, and dynamically generating schema file data comprising markup language data structures generated in response to the retrieved database table values, the markup language data structures comprising the results of the database query.
    Type: Application
    Filed: May 16, 2007
    Publication date: December 20, 2007
    Inventor: Donald Cruver