Structured Document Compression Patents (Class 715/242)
  • Patent number: 8346737
    Abstract: A compact binary encoding technique for information that is logically hierarchically structured, such as XML data, maintains all of the features of XML data in a useable form, such as the hierarchical structure underlying the data. Hence, data encoded in this format can undergo XML-based processing on-the-fly as it is being received or fetched, as if the data was being processed linearly in its textual character-based format. Processing of data encoded in this format can begin without having to wait for and decode the entire data set. The overhead due to XML tags is significantly minimized. The encoded data can be processed more efficiently because the data is pre-parsed. Values may be stored in their native type formats and, therefore, processing of the encoded data avoids costly type conversions. Further, any available structural constraint information can be effectively exploited.
    Type: Grant
    Filed: July 14, 2005
    Date of Patent: January 1, 2013
    Assignee: Oracle International Corporation
    Inventors: Ravi Murthy, Eric Sedlar, Dmitry Lenkov, Sivasankaran Chandrasekar, K. Karun, Anjana Manian, Olga Peschansky, Kwok Lun Alex Yiu
  • Patent number: 8315601
    Abstract: A system and method for processing attachments to messages sent to a mobile device is described herein. Embodiments described herein apply to encrypted messages comprising multiple message parts, in which different encryption keys (e.g. session keys) have been used to encrypt the different message parts. In at least one example embodiment, the encrypted session keys for every message content part comprising an attachment is received at the mobile device. In one embodiment, all of the encrypted session keys are stored together in a main message header. The mobile device may then decrypt the encrypted session key associated with a user requested attachment, and transmits the decrypted session key to one or more remote servers in an attachment request for use in decrypting the requested attachment. Data associated with the requested attachment, in decrypted form, is returned to the mobile device.
    Type: Grant
    Filed: April 30, 2012
    Date of Patent: November 20, 2012
    Assignee: Research In Motion Limited
    Inventors: Michael Kenneth Brown, Michael Grant Kirkup, Michael Stephen Brown
  • Patent number: 8312370
    Abstract: Described embodiments provide a system for assembly of structured documents having at least a memory, a processor, and a circuit. The memory stores documents. The processor identifies a template for generating a document. The template defines fields having a specified position and length within the document. The processor stores characters based on data associated with fields to locations of memory associated with the fields. Based partly on the text and length of the field, the processor stores indicators for portions of fields to be removed from the document. An indicator includes one or more specified character values, which may include a first byte value indicative of a start-of-heading character, and a second of byte value indicative of a number of characters to be removed. A circuit accesses the document from the memory and removes portions of fields based on the indicators to generate a portion of an assembled document.
    Type: Grant
    Filed: March 10, 2009
    Date of Patent: November 13, 2012
    Assignee: LSI Corporation
    Inventors: Eric T Lemoine, Tak Sze, Eric Shoquist, Mario Niebla, Michael Leventhal
  • Patent number: 8271868
    Abstract: An embodiment of the current invention is directed to inserting content from a source website into a destination website. Content from the source website is selected as selected content. A destination website is navigated to and, thereafter, a script is executed in a window displaying the destination website. In embodiments, the script performs a process that includes retrieving the selected content and displaying the selected content on the destination website.
    Type: Grant
    Filed: March 25, 2009
    Date of Patent: September 18, 2012
    Assignee: Microsoft Corporation
    Inventors: Aleksey Y. Sinyagin, Kevin James Riedy, Eric Jon Juvet
  • Patent number: 8255872
    Abstract: A method (300) of editing program code is proposed. The program code consists of main instructions written in a high-level language. Typically, the program code further includes one or more sets of service instructions, which are not directly connected to a desired algorithm (such as tracing instructions for debugging purposes, performance measuring instructions, and the like). In the solution of the invention, each set of service instructions is enclosed (321-322) between a pair of predefined comments defining a starting tag and an ending tag. An editor of the program code is provided with commands for automatically disabling (341) the service instructions (converting them into corresponding comments); at the same time, the disabled instructions are condensed (340) on the monitor. Additional commands are available for automatically enabling (368) and restoring (367) the service instructions.
    Type: Grant
    Filed: December 4, 2007
    Date of Patent: August 28, 2012
    Assignee: International Business Machines Corporation
    Inventors: Cosimo Vampo, Raimondo Costino, Eliseba Costantini
  • Patent number: 8244692
    Abstract: Various embodiments described herein provide systems, methods, software, and data structures that allow a page description language reader application, such as ADOBE® ACROBAT®, to provide a view of the assets compressed within a compressed archive. These and other embodiments including applications that may interleave one or more compressed archives within a page description language file, such as a PDF file. This interleaving leverages properties of both compressed archive files and page description language files to insert object markers around compressed assets within the compressed archives and references to at least some of the object markers in an index portion of the page description language files. As a result, compressed assets of compressed archives included in a page description language file package may be viewed within the listing of assets of the page description language file. These and other embodiments are described in greater detail below.
    Type: Grant
    Filed: June 30, 2010
    Date of Patent: August 14, 2012
    Assignee: Adobe Systems Incorporated
    Inventor: Philip Levy
  • Patent number: 8230327
    Abstract: A system which enables an administrator to conveniently specify statements of a web page description which may require additional processing. The administrator may select from a list of statement types, and a language grammar corresponding to the selected statement type is displayed. The administrator may provide variable data associated with the fields of the displayed language grammar, and a rule instance is created from the language grammar and the variable data. An intermediate server may determine whether each statement of a web page description matches any of the rule instances. If a match is detected, the corresponding statement is determined to potentially require additional processing (such as modification of a URL in a reverse proxy server).
    Type: Grant
    Filed: March 3, 2003
    Date of Patent: July 24, 2012
    Assignee: Oracle America, Inc.
    Inventors: Nagendra Kumar Raja, Thomas R. Mueller, Sanjib Ghosh
  • Patent number: 8230330
    Abstract: An apparatus which processes a flow description document detects a common part between the first and second flow description documents, and rewrites the common part in the second flow description document into a reference to that in the first flow description document.
    Type: Grant
    Filed: July 14, 2008
    Date of Patent: July 24, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventor: Shingo Iwasaki
  • Publication number: 20120151330
    Abstract: A method and apparatus of encoding and decoding an Extensible Markup Language (XML) document using a path code is provided. A method of encoding an XML document, the method including: searching the XML document for all element tags and all attributes including character data; extracting an XPath of each of the retrieved element tags and attributes; converting the extracted XPath into a predetermined path code; and expressing an occurrence indicator of each of all element tags included in the converted XPath.
    Type: Application
    Filed: October 1, 2009
    Publication date: June 14, 2012
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Joo Myoung Seok, Min-Sik Park, Han Kyu Lee, Jin Woo Hong
  • Patent number: 8200975
    Abstract: Systems and/or methods enabling an individual to digitally sign data using an identifier where the data being digitally signed and the identifier are not accessed at the same location are described. In some cases, the systems and/or methods permit an individual to digitally sign data for an electronic document using an identifier inaccessible by a remote computer where the electronic document is rendered using view information from that remote computer. The systems and/or methods may also enable a network computer to send view information for, and a digital encoding of, an electronic document to a local computer effective to enable the local computer to digitally sign the electronic document by signing the digital encoding of electronic document.
    Type: Grant
    Filed: June 29, 2005
    Date of Patent: June 12, 2012
    Assignee: Microsoft Corporation
    Inventors: Brian G. O'Connor, Kalpita S. Deobhakta, Silviu Ifrim
  • Patent number: 8184335
    Abstract: An overall processing time to rasterize, at the first device, the electronic document to be rendered is computed. Also, a rendering time to render, at the first device, the electronic document to be rendered is computed. When the overall processing time to rasterize at the first device is greater than the rendering time to render at the first device, the electronic document to be rendered is parsed into a first document and sub-documents. A productivity capacity of each node is determined, the productivity capacity being a measured of the processing power of the node and the communication cost of exchanging information between the first device and the node. A sub-document is rasterized at a node when a productivity capacity of the node reduces the processing time to rasterize the electronic document to be rendered to be less than the computed overall processing time.
    Type: Grant
    Filed: March 25, 2008
    Date of Patent: May 22, 2012
    Assignee: Xerox Corporation
    Inventors: Hua Liu, Steven J. Harrington
  • Patent number: 8161380
    Abstract: Provided are techniques for pipeline optimization based on polymorphic schema knowledge. A hierarchical document to be processed by a pipeline of transformations is received. It is determined whether a next downstream transformation accesses content of each schema node in an associated input schema, wherein the input schema is a polymorphic schema. In response to determining that the next downstream transformation is not accessing the content of each schema node in the associated input schema, data items corresponding to each schema node that the next downstream transformation is not accessing are collected into a single compressed event. The collected items are passed to the next downstream transformation as the single compressed event.
    Type: Grant
    Filed: June 26, 2008
    Date of Patent: April 17, 2012
    Assignee: International Business Machines Corporation
    Inventors: Amir Bar-Or, Michael James Beckerle
  • Patent number: 8131721
    Abstract: An information retrieval apparatus includes an acquiring unit that acquires a numerical value defining a boundary of a numerical range; a detecting unit that detects a number of places in and a head numeral of the numerical value; an extracting unit that extracts from a bit string group, a bit string indicating whether a numerical value in a numerical value group having the number of places and the head numeral is present in files subject to retrieval; a specifying unit that specifies a file corresponding to a bit in the extracted bit string, the bit indicating the presence of a numerical value of the numerical value group; a determining unit that determines whether a numerical value in the specified file meets the boundary condition; and a designating unit that, based on a determination by the determining unit designates the specified file to have a numerical value within the numerical range.
    Type: Grant
    Filed: April 6, 2009
    Date of Patent: March 6, 2012
    Assignee: Fujitsu Limited
    Inventors: Masahiro Kataoka, Hiroyuki Torii, Masahiro Kurishima, Hideo Kasai
  • Patent number: 8099663
    Abstract: A computer readable storage medium includes executable instructions to convert a destination document into a destination document tree hierarchy. The destination document tree hierarchy is flattened into a destination document hash table comprising a set of destination document keys and a set of destination document values. A source document is converted into a source document tree hierarchy. The source document tree hierarchy is flattened into a source document hash table comprising a set of source document key-value pairs. A source document key-value pair comprising a key and a value is identified, where the key is not in the set of destination document keys. The source document key-value is added to a changelist. A destination document key is identified, where the destination document key is not a key in a key-value pair in the set of source document key-value pairs. The destination document key is added to the changelist.
    Type: Grant
    Filed: July 19, 2007
    Date of Patent: January 17, 2012
    Assignee: Business Objects Software Ltd.
    Inventors: Ian Anthony Mayor, Victor Wai Kit Tang
  • Patent number: 8081093
    Abstract: A code transforming apparatus inputs therein a first codestream of a target file, and outputs a second codestream transformed from the first codestream. In the code transforming apparatus, a BOX decomposing unit extracts BOX information from the first codestream. A sharing/referencing determining unit determines whether sharing process or referencing process is to be performed on the BOX information. A sharing/referencing processing unit performs the sharing process or the referencing process on the BOX information based on a determination result from the sharing/referencing determining unit. The sharing/referencing processing unit can specify a logical set of codes on a referenced side as a unit of reference for referencing an incremental codestream.
    Type: Grant
    Filed: April 27, 2007
    Date of Patent: December 20, 2011
    Assignee: Ricoh Company, Ltd.
    Inventor: Akio Matsubara
  • Patent number: 8082226
    Abstract: In a distributed or multi-user system, it is possible for a user to try to access a media file that is not optimal for the user's purposes. In other words, the best quality version of some source material might not be the most optimal version for all users. For example, some users might use less network bandwidth if they used lower resolution video data. User context information is used to limit which versions of a source may be provided to a user that requests the source. The system interprets the user context information to provide the best quality version of the source for the user's purposes. Example user context information may include a specified working resolution and/or a specified target resolution for the user. The context information may be specified for the user or may be specified for a group of users, location of the user or other characteristic related to the user, from which the user in turn inherits the context information.
    Type: Grant
    Filed: October 3, 2008
    Date of Patent: December 20, 2011
    Assignee: Avid Technology, Inc.
    Inventors: Louis Ayotte, Donald Carignan, Lyne Champagne, Steven Lowell, Roger Sacilotto, Roger Tawa, Jr., Stéphane Daigle
  • Patent number: 8073824
    Abstract: A method of indexing multidimensional data obtains (302) a data set (210) describing features defined over at least one dimension other than time. Index data (212) comprising at least one value corresponding to the at least one non-time dimension is also obtained. The data set is compressed (306) and the index data is associated (308) with the compressed data set such that the compressed data set is identifiable by the index data.
    Type: Grant
    Filed: July 22, 2008
    Date of Patent: December 6, 2011
    Assignee: BAE Systems PLC
    Inventors: Jamil Appa, David Fin William Standingford
  • Publication number: 20110283183
    Abstract: The invention concerns a method for compressing and decompressing a structured document, associated with at least a tree diagram structure defining a document structure and comprising nested structure elements, associated with a type of information, and representing sets of data, the method comprising steps which consists in: performing a syntactic analysis of the structure diagram and standardizing it so as to obtain a single predefined sequence of the elements of the diagram; compiling the standardized diagram to obtain finite automata, each automaton comprising states interconnected by transitions respectively representing the elements of the structure; and compressing the document, and executing at least a compression algorithm associated with a type of information, when a set of data having the type of information is encountered in the document.
    Type: Application
    Filed: July 26, 2011
    Publication date: November 17, 2011
    Applicant: EXPWAY
    Inventors: Cedric Thienot, Claude Seyrat
  • Patent number: 8001133
    Abstract: A method of managing a contact list in a communication system includes: a) in response to user manipulation, generating a contact identification (ID) representing a contact person in the communication system, and presenting the contact ID on the contact list; b) keeping track of a contact frequency between the user and the contact person; c) selecting an attribute in accordance with the contact frequency; and d) presenting the attribute on the contact list, such that the attribute is associated with the contact ID. An apparatus for managing a contact list in a communication system is also disclosed.
    Type: Grant
    Filed: September 18, 2008
    Date of Patent: August 16, 2011
    Assignee: E-Ten Information Systems Co., Ltd
    Inventor: Po-Chun Chen
  • Patent number: 7987420
    Abstract: Described herein are systems, methods, computer program products, and combinations and sub-combinations thereof, for enabling web content (as well as other objects) to be loaded on mobile devices (as well as other types of devices), and for users of mobile devices to operate with such web content on their mobile devices in an interactive manner while in an off-line mode.
    Type: Grant
    Filed: November 6, 2000
    Date of Patent: July 26, 2011
    Assignee: iAnywhere Solutions, Inc.
    Inventors: David D. Kloba, Michael R. Gray, David M. Moore, Thomas E. Whittaker, David J. Williams, Rafael Z. Weinstein, Joshua E. Freeman, Linus M. Upson
  • Patent number: 7975220
    Abstract: The structured document management apparatus includes a document data accepting unit that accepts input of structured document data having a hierarchical logic structure; a structure guide data storage unit that stores structure guide data which is a summary of hierarchical structure information of the structured document data; a structure stream converting unit that syntax-analyzes the accepted structured document data, and converts the structure information in the structured document data into structure stream data as one-dimensional sequence data using the structure guide data; and a structure stream data storage unit that stores the converted structure stream data.
    Type: Grant
    Filed: September 21, 2006
    Date of Patent: July 5, 2011
    Assignee: Kabushiki Kaisha Toshiba
    Inventor: Masakazu Hattori
  • Publication number: 20110145700
    Abstract: When a structured document includes a compressed structure channel, a structured document analysis apparatus decompresses the structure channel. The structured document analysis apparatus generates an event list including structure information (events) of the structured document, and reference information (block numbers, channel numbers, value numbers) that refers to values. The structured document analysis apparatus notifies an application program of contents of the event list. Subsequently, when the user application requests a value, if the value is compressed, the structured document analysis apparatus decompress the value and notifies the application program of the value.
    Type: Application
    Filed: December 14, 2010
    Publication date: June 16, 2011
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Keisuke Tamiya
  • Publication number: 20110138270
    Abstract: The present invention provides a method and a device for processing a structured document by steps of obtaining an access mode for a consuming party of the structured document to an element in the structured document, the element comprising tag and content; determining a compression rule based on the access mode, the compression rule specifying at least one element to be compressed and at least one element not to be compressed in the structured document; and replacing the at least one element to be compressed with a compressed element to form a compressed structured document, wherein the tag of the compressed element is a specific compression tag, and the content of the compressed element is a result of compressing the at least one element to be compressed.
    Type: Application
    Filed: October 30, 2010
    Publication date: June 9, 2011
    Applicant: International Business Machines Corporation
    Inventors: Li Li, Qingbo Wang, Zhe Xiang, Yi Xin Zhao
  • Patent number: 7949355
    Abstract: A system and method for processing attachments to messages sent to a mobile device is described herein. Embodiments described herein apply to encrypted messages comprising multiple message parts, in which different encryption keys (e.g. session keys) have been used to encrypt the different message parts. In at least one example embodiment, the encrypted session keys for every message content part comprising an attachment is received at the mobile device. In one embodiment, all of the encrypted session keys are stored together in a main message header. The mobile device may then decrypt the encrypted session key associated with a user requested attachment, and transmits the decrypted session key to one or more remote servers in an attachment request for use in decrypting the requested attachment. Data associated with the requested attachment, in decrypted form, is returned to the mobile device.
    Type: Grant
    Filed: September 4, 2007
    Date of Patent: May 24, 2011
    Assignee: Research In Motion Limited
    Inventors: Michael K. Brown, Michael G. Kirkup, Michael S. Brown
  • Patent number: 7940929
    Abstract: A method, system and computer program product for processing documents containing restricted information. The method for processing the documents includes the step of identifying data fields in the document. The method further includes the steps of identifying a critical section containing data fields corresponding to restricted information, and dividing the document into a critical section and a non-critical section. Further, the method includes the step of processing the non-critical section at a minimally secure location.
    Type: Grant
    Filed: November 23, 2005
    Date of Patent: May 10, 2011
    Assignee: BeyondCore, Inc.
    Inventor: Arijit Sengupta
  • Patent number: 7930630
    Abstract: A parser uses SAX-like APIs for efficiency and provides DOM-like APIs for simplicity and convenience. The parser can read an XML document, for example, and parse its contents into a set of abstract information containers called nodes. The nodes are used to represent the structure and content of the document, which allows applications to read and manipulate the information in the document while not having to explicitly understand XML syntax. After a document is parsed, the nodes can be explored in any direction; they are not limited to straight-through text file processing.
    Type: Grant
    Filed: May 31, 2006
    Date of Patent: April 19, 2011
    Assignee: Microsoft Corporation
    Inventor: Abid Masood
  • Patent number: 7895515
    Abstract: A method for detecting indicators of misleading content in a markup language coded document is provided. The method includes extracting a set of tags from the markup language coded document. The method also includes combining tags to create a tag structure signature. The tag structure signature is configured to include a set of n-grams. Each of the set of n-grams includes at least two tags from the set of tags. The method further includes comparing the tag structure signature against a set of known bad tag structure signatures to determine similarity.
    Type: Grant
    Filed: February 28, 2007
    Date of Patent: February 22, 2011
    Assignee: Trend Micro Inc
    Inventors: Jonathan James Oliver, Hsin-Yi Chen, Guan-Liang Chen, Cheng Hsin Hsu
  • Patent number: 7895513
    Abstract: Methods and apparatus, including computer program products, for configuring an electronic document for a presentation environment, where the electronic document includes a plurality of information types. A display constraint for the presentation environment is determined based on text information in the electronic document. A size of an electronic document is calculated under a first set of conditions satisfying the display constraint. A determination is made whether the calculated size satisfies a processing constraint of the presentation environment. If the calculated size does not satisfy the processing constraint, a reduced representation of the electronic document is configured by reducing information of a secondary information type of the electronic document, such that the reduced representation satisfies both the display constraint and the processing constraint and retains spatial relationships of the text information in the electronic document.
    Type: Grant
    Filed: May 28, 2003
    Date of Patent: February 22, 2011
    Assignee: Adobe Systems Incorporated
    Inventors: Atul K. Puri, Susanta Sarkar, Ajay Pande
  • Patent number: 7890853
    Abstract: A markup language document (e.g. an XML document) may contain a first instance of a markup language element having an attribute with a first, platform-independent value, a second instance of the element having the attribute with a second, platform-specific value, and an indicator that the second value overrides the first. Executable code may replace both of the first and second instances with a single instance having the attribute with the second value. Alternatively the markup language document may contain a first instance of a markup language element having a first, platform-independent attribute, a second instance of the element having a second, platform-specific attribute different from the first attribute, and an indicator that the second instance extends the first. In this case executable code may replace both instances of the markup language element with a single instance having the first attribute and the second attribute.
    Type: Grant
    Filed: February 13, 2006
    Date of Patent: February 15, 2011
    Assignee: Nextair Corporation
    Inventors: Tim Neil, Steve Grenier, Paul Chalmers
  • Publication number: 20110016384
    Abstract: A method is provided for displaying a page of content contained in an optimized document. In some embodiments, an optimized document is obtained, where the optimized document contains at least one page of content. A compressed page record in the optimized document that corresponds to a page to be displayed may be located. The compressed page record may be decompressed using a decompression counterpart to a technique used to compress the page record. An unfiltering may be performed on each set of values in the page record according to a filter determined to be applied to the particular set of values. Thereafter, the decompressed content may be displayed.
    Type: Application
    Filed: September 24, 2010
    Publication date: January 20, 2011
    Inventors: Joshua Shagam, Robert L. Goodwin
  • Patent number: 7856345
    Abstract: A method for managing and/or producing an output, the method comprising the steps of providing one or more layouts, each layout being divided into a number of zones, the one or more layouts in combination at least substantially covering the output; providing a number of rendering elements, each being adapted to perform a function and/or an action; assigning one or more relations and/or one or more orderings between the rendering elements and the zones; processing the zones of the one or more layouts according to the one or more relations and/or the one or more orderings, and by means of the rendering elements; providing, for each of the zones, an output based on the rendering elements and the one or more relations and/or orderings; and collecting the output of each of the zones into a final output.
    Type: Grant
    Filed: November 21, 2002
    Date of Patent: December 21, 2010
    Assignee: Sitecore A/S
    Inventors: Jakob H. Christensen, Ole S. Thrane
  • Patent number: 7849403
    Abstract: Techniques are disclosed for efficiently updating rendered content (such as content of a Web page) using a “slow-loading” content element, such as a slow-loading image. A reference is embedded within the markup language notation for the content to be rendered, where this reference identifies the source of the slow-loading content element. Delivery of the slow-loading content therefore begins automatically, when the content is rendered. Event handling attributes are specified with the reference, where values of these attributes identify client-side logic to be invoked when the associated event occurs. If the server determines that the rendered content, or some portion thereof, should be asynchronously updated, it abruptly terminates delivery of the slow-loading content. This termination triggers an event handler, which operates to automatically request reloading of the content.
    Type: Grant
    Filed: August 1, 2007
    Date of Patent: December 7, 2010
    Assignee: International Business Machines Corporation
    Inventors: Niraj P. Joshi, Robert C. Leah, Paul F. McMahan
  • Patent number: 7836396
    Abstract: A style compressor for improving compression of a web document collects all the style attributes referenced by elements of the web document into a central style repository. The style compressor replaces each reference by each of the elements with a class attribute assigned in the central style repository to the replaced style attributes. Next, the style compressor organizes the style attributes in the central style repository and compresses the style attributes into a minimized selection of class attributes each referencing at least one of the style attributes. The style compressor updates the elements of the web document to reference one or more of the minimized selection of class attributes in the central style repository to produce a shorter, more efficiently organized web document.
    Type: Grant
    Filed: January 5, 2007
    Date of Patent: November 16, 2010
    Assignee: International Business Machines Corporation
    Inventor: Daniel O. Becker
  • Patent number: 7823064
    Abstract: Various embodiments described herein provide systems, methods, software, and data structures that allow a page description language reader application, such as Adobe® Acrobat®, to provide a view of the assets compressed within a compressed archive. These and other embodiments including applications that may interleave one or more compressed archives within a page description language file, such as a PDF file. This interleaving leverages properties of both compressed archive files and page description language files to insert object markers around compressed assets within the compressed archives and references to at least some of the object markers in an index portion of the page description language files. As a result, compressed assets of compressed archives included in a page description language file package may be viewed within the listing of assets of the page description language file. These and other embodiments are described in greater detail below.
    Type: Grant
    Filed: August 30, 2007
    Date of Patent: October 26, 2010
    Assignee: Adobe Systems Incorporated
    Inventor: Philip Levy
  • Patent number: 7818664
    Abstract: The present invention is a computer software product and method including an XML parsing module, a translation module communicatively coupled to the XML parsing module and an output module communicatively coupled to the translation module, the output module configurable by the end user to execute a predefined event responsive to the processing of an associated XML tag. The predefined event may be selected from a group consisting of audio output modification, visual output modification, and tactile output modification. These output modifications may include, but are not limited to, voice inflection, voice gender, voice volume, voice pitch, voice speed, visual display modification, visual display cues, tactile device output modification and the like.
    Type: Grant
    Filed: March 16, 2005
    Date of Patent: October 19, 2010
    Assignee: Freedom Scientific, Inc.
    Inventors: Glen Gordon, Christian Hofstader, Joseph Stephen, Eric Damery
  • Patent number: 7810026
    Abstract: A method for optimizing a source document comprising a plurality of pages of content, comprising each of the following, is presented. A source document is obtained. An optimized document is created corresponding to the source document. Thereafter, for each page in the source document, the following are applied. A page record is created for the page. Each page record comprises a word table comprising a list of the page's words in the order that they appear in the page's content. Each page record further comprises a paragraph entry list for the page including a paragraph entry for each paragraph in the page. Each paragraph entry includes a reference to the first and last word of that paragraph in the word table. The page record is compressed using a compression technique. Thereafter, the compressed page record is stored in the optimized document.
    Type: Grant
    Filed: September 29, 2006
    Date of Patent: October 5, 2010
    Assignee: Amazon Technologies, Inc.
    Inventors: Joshua Shagam, Robert L Goodwin
  • Patent number: 7769904
    Abstract: An extensible binary mark-up language is disclosed that is compatible with existing XML standards yet provides significantly improved efficiencies for XML-based data storage and communications, particularly for narrow and low bandwidth communication media. A corresponding extensible non-binary mark-up language is also disclosed that is compatible with the XML standard. This dual-representation common message format (CMF) allows standard XML tools to be utilized in viewing and editing XML-based data and allows a CMF parser to be utilized to convert the XML formatted information into an extensible binary representation for actual communication through a medium or storage on a wide range of media.
    Type: Grant
    Filed: June 9, 2004
    Date of Patent: August 3, 2010
    Assignee: L-3 Communications Integrated Systems L.P.
    Inventor: Bill J. Eller
  • Patent number: 7742048
    Abstract: A method, system, and apparatus are provided for converting numbers to text and to digit shapes expressed in different languages. When a string of text is entered into an application program, the string is analyzed to determine whether the string of text includes a number expressed using digits. If the string of text includes a number expressed using digits, the number is semantically labeled with schema information. When a selection is received of the number, a list of actions may be provided identifying conversion actions available for the number. One of the conversion options may then be selected resulting in the conversion of the selected number to text in one or more languages. A conversion option may also be selected for converting the number digits to digits expressed using other digit shapes. The converted number may then be inserted into the string of text to replace the selected number.
    Type: Grant
    Filed: May 23, 2002
    Date of Patent: June 22, 2010
    Assignee: Microsoft Corporation
    Inventors: Thomas G. Moore, Mohamed A. Abbar, Nader G. Issa
  • Patent number: 7743321
    Abstract: A method of processing data associated with processing options, wherein selected processing options select variants within the data and selected variants are activated, and variants associated with unselected options can be selected in a further iteration of the process.
    Type: Grant
    Filed: October 28, 2005
    Date of Patent: June 22, 2010
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Roger Brian Gimson
  • Patent number: 7721199
    Abstract: Two files are configured for more rapid comparison by removing blank spaces from the code of all of the lines in each of the files to be compared and also by attaching a control to the beginning of each line of code in each of the two sources prior to be compared. After blank spaces are removed and the controls attached, the two files are compared to determine additions, deletions and changes. The program continues to compare the strings of identical length until all of the strings have been compared. By comparing only strings of identical length, the program is able to compare two sources in substantially less time than by using conventional means of comparison.
    Type: Grant
    Filed: March 3, 2006
    Date of Patent: May 18, 2010
    Assignee: International Business Machines Corporation
    Inventor: Charles Gordon Hooks
  • Patent number: 7721204
    Abstract: Methods and systems that dynamically control and/or direct document image processing based on one or more of client device display capabilites, image formats supported by the client device, network bandwidth, document file format, document content and user hints.
    Type: Grant
    Filed: July 29, 2004
    Date of Patent: May 18, 2010
    Assignee: Xerox Corporation
    Inventors: Reiner Eschbach, Emil V. Rainero
  • Patent number: 7693848
    Abstract: A method and apparatus is provided for converting a document in a first format essentially comprising a flat layout structure into a structured document in a hierarchical form in accordance with predetermined attributes identified from the input format. The process comprises fragmenting the input document into a plurality of document content elements in accordance with a predetermined set of document attributes identifiable from the input document format. The content elements are clustered into selective sets having similar document attributes. The clustered sets are validated with reference to common textual properties organizational content common in documents in the collection. The clustered sets are then categorized into predetermined categories comprising structured elements of the structured document format and the document content elements are organized by hierarchical dependency from the predetermined categories wherein the organized document elements comprise the desired structured document format.
    Type: Grant
    Filed: January 10, 2005
    Date of Patent: April 6, 2010
    Assignee: Xerox Corporation
    Inventors: Hervé Déjean, Veronika Lux, Sandrine Ribeau
  • Publication number: 20100083101
    Abstract: A structured document is coded as a bit stream. The structured document can be described in Extensible Markup Language (XML). The structured document can be decoded to access a subpart of the document. Hierarchized events included in the structured document are coded as corresponding events using at least one encoding table, which is updated during the coding. Hierarchized events involved in an update of the at least one encoding table are determined. Marking information is associated with the coded events corresponding to the hierarchized events determined to be involved in the update of the at least one encoding table.
    Type: Application
    Filed: September 30, 2009
    Publication date: April 1, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventors: Franck Denoual, Herve Ruellan
  • Publication number: 20100060935
    Abstract: The image data processor generates storage layout information and output joint information, and then generates a page raster for storage based on the storage layout information and a page raster for output based on the output joint information. The storage layout information is based on layout information describing object layouts and includes a page formed by the same objects and reusable objects of the same arrangements and a page formed by each variable object. The output joint information combines a page for reusable objects and a page for the variable object.
    Type: Application
    Filed: August 12, 2009
    Publication date: March 11, 2010
    Applicant: FUJIFILM CORPORATION
    Inventors: Mitsuru NAKAO, Nobuyuki SHITARA
  • Patent number: 7676742
    Abstract: A system and method for processing of markup language information, such as extensible markup language (XML) based control information is disclosed. The system comprises a compression sender entity and a decompression receiver entity. The sender entity uses its internal, binary representation derived from textual XML-based protocol header and encodes the binary representation as a tag-length-value (TLV) binary value. The TLV binary value is embedded as encoded data in a specifically marked comment, which may be detected by the receiver entity. The receiver entity builds up an internal, binary data structure based on the encoded data representing the original XML data. The system avoids the usage of string data to represent a small set of well known strings and uses a more compact binary encoding which reduces space and time to process the data.
    Type: Grant
    Filed: July 17, 2004
    Date of Patent: March 9, 2010
    Assignee: International Business Machines Corporation
    Inventors: Carl Binding, Reto Hermann, Andreas Schade
  • Patent number: 7669120
    Abstract: A method and system for encoding a mark-up language document is provided, in which, the structure of the mark-up language document is condensed by removing those parts of the structure that are fixed, and by expressing the variable parts of the structure in terms of whether or how often they occur. A template that specifies which parts of the structure are fixed and which parts are variable may be provided to both the device that is transmitting the mark-up language document and the device that is receiving the mark-up language document.
    Type: Grant
    Filed: June 21, 2002
    Date of Patent: February 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Hongbin Liao, Qian Zhang, Wenwu Zhu
  • Patent number: 7653752
    Abstract: A script is scanned to detect character strings that match predetermined criteria. The detected character strings are sorted in order of their appearance frequency and are associated with substitute characters or character strings such that a character string having a greater appearance frequency than another character string is associated with substitute characters or character string having a same or smaller number of characters. The script is searched for a given character string, and the given character string is replaced with its associated substitute characters or character strings. The content is unidirectionally distributed whereby the plurality of character strings is not distributed.
    Type: Grant
    Filed: September 25, 2007
    Date of Patent: January 26, 2010
    Assignee: Sony Corporation
    Inventor: Yoshiharu Dewa
  • Patent number: 7647552
    Abstract: Disclosed is a method (900) for communicating at least part of a structure of a document (104) described by a hierarchical representation (102). The method identifies (902) the hierarchical representation (eg. the tree structure) of the document (104). The identification is preferably performed using XML tags. The representation is then packetized (906) into a plurality of data packets. At least one link is then created (908) between a pair of the packets, the link acting to represent an interconnection between corresponding components (eg. structure and content) of the representation. The packets are then formed (910) into a stream for communication. The links maintain the hierachical representation within the packets.
    Type: Grant
    Filed: October 5, 2001
    Date of Patent: January 12, 2010
    Assignee: Canon Kabushiki Kaisha
    Inventor: Ernest Yiu Cheong Wan
  • Publication number: 20090327870
    Abstract: Provided are techniques for pipeline optimization based on polymorphic schema knowledge. A hierarchical document to be processed by a pipeline of transformations is received. It is determined whether a next downstream transformation accesses content of each schema node in an associated input schema, wherein the input schema is a polymorphic schema. In response to determining that the next downstream transformation is not accessing the content of each schema node in the associated input schema, data items corresponding to each schema node that the next downstream transformation is not accessing are collected into a single compressed event. The collected items are passed to the next downstream transformation as the single compressed event.
    Type: Application
    Filed: June 26, 2008
    Publication date: December 31, 2009
    Applicant: International Business Machines Corporation
    Inventors: Amir BAR-OR, Michael James BECKERLE
  • Patent number: 7640492
    Abstract: Methods and structures for parsing an Extensible Markup Language (XML) data stream are described. In the described embodiment, one or more schema modules are defined and are associated with HTTP requests that are received from a client. The schema module(s) has a function that determines whether an XML data stream conforms to a given schema that is associated with the HTTP request. If a portion of the XML data stream does not conform to the given schema, then the schema module disregards that XML data stream portion. In the described embodiment, each schema module has a plurality of states associated therewith. Each state is associated with one or more schema requirements that relate to a particular element that is evaluated by the schema module. Each state is different from the other states and the number of states is a function of the number of layers or elements that are contained within a particular XML data stream.
    Type: Grant
    Filed: July 6, 2005
    Date of Patent: December 29, 2009
    Assignee: Microsoft Corporation
    Inventors: Joel M. Soderberg, Brian J. Deen