Structured Document Compression Patents (Class 715/242)
-
Patent number: 8346737Abstract: A compact binary encoding technique for information that is logically hierarchically structured, such as XML data, maintains all of the features of XML data in a useable form, such as the hierarchical structure underlying the data. Hence, data encoded in this format can undergo XML-based processing on-the-fly as it is being received or fetched, as if the data was being processed linearly in its textual character-based format. Processing of data encoded in this format can begin without having to wait for and decode the entire data set. The overhead due to XML tags is significantly minimized. The encoded data can be processed more efficiently because the data is pre-parsed. Values may be stored in their native type formats and, therefore, processing of the encoded data avoids costly type conversions. Further, any available structural constraint information can be effectively exploited.Type: GrantFiled: July 14, 2005Date of Patent: January 1, 2013Assignee: Oracle International CorporationInventors: Ravi Murthy, Eric Sedlar, Dmitry Lenkov, Sivasankaran Chandrasekar, K. Karun, Anjana Manian, Olga Peschansky, Kwok Lun Alex Yiu
-
Patent number: 8315601Abstract: A system and method for processing attachments to messages sent to a mobile device is described herein. Embodiments described herein apply to encrypted messages comprising multiple message parts, in which different encryption keys (e.g. session keys) have been used to encrypt the different message parts. In at least one example embodiment, the encrypted session keys for every message content part comprising an attachment is received at the mobile device. In one embodiment, all of the encrypted session keys are stored together in a main message header. The mobile device may then decrypt the encrypted session key associated with a user requested attachment, and transmits the decrypted session key to one or more remote servers in an attachment request for use in decrypting the requested attachment. Data associated with the requested attachment, in decrypted form, is returned to the mobile device.Type: GrantFiled: April 30, 2012Date of Patent: November 20, 2012Assignee: Research In Motion LimitedInventors: Michael Kenneth Brown, Michael Grant Kirkup, Michael Stephen Brown
-
Patent number: 8312370Abstract: Described embodiments provide a system for assembly of structured documents having at least a memory, a processor, and a circuit. The memory stores documents. The processor identifies a template for generating a document. The template defines fields having a specified position and length within the document. The processor stores characters based on data associated with fields to locations of memory associated with the fields. Based partly on the text and length of the field, the processor stores indicators for portions of fields to be removed from the document. An indicator includes one or more specified character values, which may include a first byte value indicative of a start-of-heading character, and a second of byte value indicative of a number of characters to be removed. A circuit accesses the document from the memory and removes portions of fields based on the indicators to generate a portion of an assembled document.Type: GrantFiled: March 10, 2009Date of Patent: November 13, 2012Assignee: LSI CorporationInventors: Eric T Lemoine, Tak Sze, Eric Shoquist, Mario Niebla, Michael Leventhal
-
Patent number: 8271868Abstract: An embodiment of the current invention is directed to inserting content from a source website into a destination website. Content from the source website is selected as selected content. A destination website is navigated to and, thereafter, a script is executed in a window displaying the destination website. In embodiments, the script performs a process that includes retrieving the selected content and displaying the selected content on the destination website.Type: GrantFiled: March 25, 2009Date of Patent: September 18, 2012Assignee: Microsoft CorporationInventors: Aleksey Y. Sinyagin, Kevin James Riedy, Eric Jon Juvet
-
Patent number: 8255872Abstract: A method (300) of editing program code is proposed. The program code consists of main instructions written in a high-level language. Typically, the program code further includes one or more sets of service instructions, which are not directly connected to a desired algorithm (such as tracing instructions for debugging purposes, performance measuring instructions, and the like). In the solution of the invention, each set of service instructions is enclosed (321-322) between a pair of predefined comments defining a starting tag and an ending tag. An editor of the program code is provided with commands for automatically disabling (341) the service instructions (converting them into corresponding comments); at the same time, the disabled instructions are condensed (340) on the monitor. Additional commands are available for automatically enabling (368) and restoring (367) the service instructions.Type: GrantFiled: December 4, 2007Date of Patent: August 28, 2012Assignee: International Business Machines CorporationInventors: Cosimo Vampo, Raimondo Costino, Eliseba Costantini
-
Patent number: 8244692Abstract: Various embodiments described herein provide systems, methods, software, and data structures that allow a page description language reader application, such as ADOBE® ACROBAT®, to provide a view of the assets compressed within a compressed archive. These and other embodiments including applications that may interleave one or more compressed archives within a page description language file, such as a PDF file. This interleaving leverages properties of both compressed archive files and page description language files to insert object markers around compressed assets within the compressed archives and references to at least some of the object markers in an index portion of the page description language files. As a result, compressed assets of compressed archives included in a page description language file package may be viewed within the listing of assets of the page description language file. These and other embodiments are described in greater detail below.Type: GrantFiled: June 30, 2010Date of Patent: August 14, 2012Assignee: Adobe Systems IncorporatedInventor: Philip Levy
-
Patent number: 8230327Abstract: A system which enables an administrator to conveniently specify statements of a web page description which may require additional processing. The administrator may select from a list of statement types, and a language grammar corresponding to the selected statement type is displayed. The administrator may provide variable data associated with the fields of the displayed language grammar, and a rule instance is created from the language grammar and the variable data. An intermediate server may determine whether each statement of a web page description matches any of the rule instances. If a match is detected, the corresponding statement is determined to potentially require additional processing (such as modification of a URL in a reverse proxy server).Type: GrantFiled: March 3, 2003Date of Patent: July 24, 2012Assignee: Oracle America, Inc.Inventors: Nagendra Kumar Raja, Thomas R. Mueller, Sanjib Ghosh
-
Patent number: 8230330Abstract: An apparatus which processes a flow description document detects a common part between the first and second flow description documents, and rewrites the common part in the second flow description document into a reference to that in the first flow description document.Type: GrantFiled: July 14, 2008Date of Patent: July 24, 2012Assignee: Canon Kabushiki KaishaInventor: Shingo Iwasaki
-
Publication number: 20120151330Abstract: A method and apparatus of encoding and decoding an Extensible Markup Language (XML) document using a path code is provided. A method of encoding an XML document, the method including: searching the XML document for all element tags and all attributes including character data; extracting an XPath of each of the retrieved element tags and attributes; converting the extracted XPath into a predetermined path code; and expressing an occurrence indicator of each of all element tags included in the converted XPath.Type: ApplicationFiled: October 1, 2009Publication date: June 14, 2012Applicant: Electronics and Telecommunications Research InstituteInventors: Joo Myoung Seok, Min-Sik Park, Han Kyu Lee, Jin Woo Hong
-
Patent number: 8200975Abstract: Systems and/or methods enabling an individual to digitally sign data using an identifier where the data being digitally signed and the identifier are not accessed at the same location are described. In some cases, the systems and/or methods permit an individual to digitally sign data for an electronic document using an identifier inaccessible by a remote computer where the electronic document is rendered using view information from that remote computer. The systems and/or methods may also enable a network computer to send view information for, and a digital encoding of, an electronic document to a local computer effective to enable the local computer to digitally sign the electronic document by signing the digital encoding of electronic document.Type: GrantFiled: June 29, 2005Date of Patent: June 12, 2012Assignee: Microsoft CorporationInventors: Brian G. O'Connor, Kalpita S. Deobhakta, Silviu Ifrim
-
Patent number: 8184335Abstract: An overall processing time to rasterize, at the first device, the electronic document to be rendered is computed. Also, a rendering time to render, at the first device, the electronic document to be rendered is computed. When the overall processing time to rasterize at the first device is greater than the rendering time to render at the first device, the electronic document to be rendered is parsed into a first document and sub-documents. A productivity capacity of each node is determined, the productivity capacity being a measured of the processing power of the node and the communication cost of exchanging information between the first device and the node. A sub-document is rasterized at a node when a productivity capacity of the node reduces the processing time to rasterize the electronic document to be rendered to be less than the computed overall processing time.Type: GrantFiled: March 25, 2008Date of Patent: May 22, 2012Assignee: Xerox CorporationInventors: Hua Liu, Steven J. Harrington
-
Patent number: 8161380Abstract: Provided are techniques for pipeline optimization based on polymorphic schema knowledge. A hierarchical document to be processed by a pipeline of transformations is received. It is determined whether a next downstream transformation accesses content of each schema node in an associated input schema, wherein the input schema is a polymorphic schema. In response to determining that the next downstream transformation is not accessing the content of each schema node in the associated input schema, data items corresponding to each schema node that the next downstream transformation is not accessing are collected into a single compressed event. The collected items are passed to the next downstream transformation as the single compressed event.Type: GrantFiled: June 26, 2008Date of Patent: April 17, 2012Assignee: International Business Machines CorporationInventors: Amir Bar-Or, Michael James Beckerle
-
Patent number: 8131721Abstract: An information retrieval apparatus includes an acquiring unit that acquires a numerical value defining a boundary of a numerical range; a detecting unit that detects a number of places in and a head numeral of the numerical value; an extracting unit that extracts from a bit string group, a bit string indicating whether a numerical value in a numerical value group having the number of places and the head numeral is present in files subject to retrieval; a specifying unit that specifies a file corresponding to a bit in the extracted bit string, the bit indicating the presence of a numerical value of the numerical value group; a determining unit that determines whether a numerical value in the specified file meets the boundary condition; and a designating unit that, based on a determination by the determining unit designates the specified file to have a numerical value within the numerical range.Type: GrantFiled: April 6, 2009Date of Patent: March 6, 2012Assignee: Fujitsu LimitedInventors: Masahiro Kataoka, Hiroyuki Torii, Masahiro Kurishima, Hideo Kasai
-
Patent number: 8099663Abstract: A computer readable storage medium includes executable instructions to convert a destination document into a destination document tree hierarchy. The destination document tree hierarchy is flattened into a destination document hash table comprising a set of destination document keys and a set of destination document values. A source document is converted into a source document tree hierarchy. The source document tree hierarchy is flattened into a source document hash table comprising a set of source document key-value pairs. A source document key-value pair comprising a key and a value is identified, where the key is not in the set of destination document keys. The source document key-value is added to a changelist. A destination document key is identified, where the destination document key is not a key in a key-value pair in the set of source document key-value pairs. The destination document key is added to the changelist.Type: GrantFiled: July 19, 2007Date of Patent: January 17, 2012Assignee: Business Objects Software Ltd.Inventors: Ian Anthony Mayor, Victor Wai Kit Tang
-
Patent number: 8081093Abstract: A code transforming apparatus inputs therein a first codestream of a target file, and outputs a second codestream transformed from the first codestream. In the code transforming apparatus, a BOX decomposing unit extracts BOX information from the first codestream. A sharing/referencing determining unit determines whether sharing process or referencing process is to be performed on the BOX information. A sharing/referencing processing unit performs the sharing process or the referencing process on the BOX information based on a determination result from the sharing/referencing determining unit. The sharing/referencing processing unit can specify a logical set of codes on a referenced side as a unit of reference for referencing an incremental codestream.Type: GrantFiled: April 27, 2007Date of Patent: December 20, 2011Assignee: Ricoh Company, Ltd.Inventor: Akio Matsubara
-
Patent number: 8082226Abstract: In a distributed or multi-user system, it is possible for a user to try to access a media file that is not optimal for the user's purposes. In other words, the best quality version of some source material might not be the most optimal version for all users. For example, some users might use less network bandwidth if they used lower resolution video data. User context information is used to limit which versions of a source may be provided to a user that requests the source. The system interprets the user context information to provide the best quality version of the source for the user's purposes. Example user context information may include a specified working resolution and/or a specified target resolution for the user. The context information may be specified for the user or may be specified for a group of users, location of the user or other characteristic related to the user, from which the user in turn inherits the context information.Type: GrantFiled: October 3, 2008Date of Patent: December 20, 2011Assignee: Avid Technology, Inc.Inventors: Louis Ayotte, Donald Carignan, Lyne Champagne, Steven Lowell, Roger Sacilotto, Roger Tawa, Jr., Stéphane Daigle
-
Patent number: 8073824Abstract: A method of indexing multidimensional data obtains (302) a data set (210) describing features defined over at least one dimension other than time. Index data (212) comprising at least one value corresponding to the at least one non-time dimension is also obtained. The data set is compressed (306) and the index data is associated (308) with the compressed data set such that the compressed data set is identifiable by the index data.Type: GrantFiled: July 22, 2008Date of Patent: December 6, 2011Assignee: BAE Systems PLCInventors: Jamil Appa, David Fin William Standingford
-
Publication number: 20110283183Abstract: The invention concerns a method for compressing and decompressing a structured document, associated with at least a tree diagram structure defining a document structure and comprising nested structure elements, associated with a type of information, and representing sets of data, the method comprising steps which consists in: performing a syntactic analysis of the structure diagram and standardizing it so as to obtain a single predefined sequence of the elements of the diagram; compiling the standardized diagram to obtain finite automata, each automaton comprising states interconnected by transitions respectively representing the elements of the structure; and compressing the document, and executing at least a compression algorithm associated with a type of information, when a set of data having the type of information is encountered in the document.Type: ApplicationFiled: July 26, 2011Publication date: November 17, 2011Applicant: EXPWAYInventors: Cedric Thienot, Claude Seyrat
-
Patent number: 8001133Abstract: A method of managing a contact list in a communication system includes: a) in response to user manipulation, generating a contact identification (ID) representing a contact person in the communication system, and presenting the contact ID on the contact list; b) keeping track of a contact frequency between the user and the contact person; c) selecting an attribute in accordance with the contact frequency; and d) presenting the attribute on the contact list, such that the attribute is associated with the contact ID. An apparatus for managing a contact list in a communication system is also disclosed.Type: GrantFiled: September 18, 2008Date of Patent: August 16, 2011Assignee: E-Ten Information Systems Co., LtdInventor: Po-Chun Chen
-
Patent number: 7987420Abstract: Described herein are systems, methods, computer program products, and combinations and sub-combinations thereof, for enabling web content (as well as other objects) to be loaded on mobile devices (as well as other types of devices), and for users of mobile devices to operate with such web content on their mobile devices in an interactive manner while in an off-line mode.Type: GrantFiled: November 6, 2000Date of Patent: July 26, 2011Assignee: iAnywhere Solutions, Inc.Inventors: David D. Kloba, Michael R. Gray, David M. Moore, Thomas E. Whittaker, David J. Williams, Rafael Z. Weinstein, Joshua E. Freeman, Linus M. Upson
-
Patent number: 7975220Abstract: The structured document management apparatus includes a document data accepting unit that accepts input of structured document data having a hierarchical logic structure; a structure guide data storage unit that stores structure guide data which is a summary of hierarchical structure information of the structured document data; a structure stream converting unit that syntax-analyzes the accepted structured document data, and converts the structure information in the structured document data into structure stream data as one-dimensional sequence data using the structure guide data; and a structure stream data storage unit that stores the converted structure stream data.Type: GrantFiled: September 21, 2006Date of Patent: July 5, 2011Assignee: Kabushiki Kaisha ToshibaInventor: Masakazu Hattori
-
Publication number: 20110145700Abstract: When a structured document includes a compressed structure channel, a structured document analysis apparatus decompresses the structure channel. The structured document analysis apparatus generates an event list including structure information (events) of the structured document, and reference information (block numbers, channel numbers, value numbers) that refers to values. The structured document analysis apparatus notifies an application program of contents of the event list. Subsequently, when the user application requests a value, if the value is compressed, the structured document analysis apparatus decompress the value and notifies the application program of the value.Type: ApplicationFiled: December 14, 2010Publication date: June 16, 2011Applicant: CANON KABUSHIKI KAISHAInventor: Keisuke Tamiya
-
Publication number: 20110138270Abstract: The present invention provides a method and a device for processing a structured document by steps of obtaining an access mode for a consuming party of the structured document to an element in the structured document, the element comprising tag and content; determining a compression rule based on the access mode, the compression rule specifying at least one element to be compressed and at least one element not to be compressed in the structured document; and replacing the at least one element to be compressed with a compressed element to form a compressed structured document, wherein the tag of the compressed element is a specific compression tag, and the content of the compressed element is a result of compressing the at least one element to be compressed.Type: ApplicationFiled: October 30, 2010Publication date: June 9, 2011Applicant: International Business Machines CorporationInventors: Li Li, Qingbo Wang, Zhe Xiang, Yi Xin Zhao
-
Patent number: 7949355Abstract: A system and method for processing attachments to messages sent to a mobile device is described herein. Embodiments described herein apply to encrypted messages comprising multiple message parts, in which different encryption keys (e.g. session keys) have been used to encrypt the different message parts. In at least one example embodiment, the encrypted session keys for every message content part comprising an attachment is received at the mobile device. In one embodiment, all of the encrypted session keys are stored together in a main message header. The mobile device may then decrypt the encrypted session key associated with a user requested attachment, and transmits the decrypted session key to one or more remote servers in an attachment request for use in decrypting the requested attachment. Data associated with the requested attachment, in decrypted form, is returned to the mobile device.Type: GrantFiled: September 4, 2007Date of Patent: May 24, 2011Assignee: Research In Motion LimitedInventors: Michael K. Brown, Michael G. Kirkup, Michael S. Brown
-
Patent number: 7940929Abstract: A method, system and computer program product for processing documents containing restricted information. The method for processing the documents includes the step of identifying data fields in the document. The method further includes the steps of identifying a critical section containing data fields corresponding to restricted information, and dividing the document into a critical section and a non-critical section. Further, the method includes the step of processing the non-critical section at a minimally secure location.Type: GrantFiled: November 23, 2005Date of Patent: May 10, 2011Assignee: BeyondCore, Inc.Inventor: Arijit Sengupta
-
Patent number: 7930630Abstract: A parser uses SAX-like APIs for efficiency and provides DOM-like APIs for simplicity and convenience. The parser can read an XML document, for example, and parse its contents into a set of abstract information containers called nodes. The nodes are used to represent the structure and content of the document, which allows applications to read and manipulate the information in the document while not having to explicitly understand XML syntax. After a document is parsed, the nodes can be explored in any direction; they are not limited to straight-through text file processing.Type: GrantFiled: May 31, 2006Date of Patent: April 19, 2011Assignee: Microsoft CorporationInventor: Abid Masood
-
Patent number: 7895515Abstract: A method for detecting indicators of misleading content in a markup language coded document is provided. The method includes extracting a set of tags from the markup language coded document. The method also includes combining tags to create a tag structure signature. The tag structure signature is configured to include a set of n-grams. Each of the set of n-grams includes at least two tags from the set of tags. The method further includes comparing the tag structure signature against a set of known bad tag structure signatures to determine similarity.Type: GrantFiled: February 28, 2007Date of Patent: February 22, 2011Assignee: Trend Micro IncInventors: Jonathan James Oliver, Hsin-Yi Chen, Guan-Liang Chen, Cheng Hsin Hsu
-
Patent number: 7895513Abstract: Methods and apparatus, including computer program products, for configuring an electronic document for a presentation environment, where the electronic document includes a plurality of information types. A display constraint for the presentation environment is determined based on text information in the electronic document. A size of an electronic document is calculated under a first set of conditions satisfying the display constraint. A determination is made whether the calculated size satisfies a processing constraint of the presentation environment. If the calculated size does not satisfy the processing constraint, a reduced representation of the electronic document is configured by reducing information of a secondary information type of the electronic document, such that the reduced representation satisfies both the display constraint and the processing constraint and retains spatial relationships of the text information in the electronic document.Type: GrantFiled: May 28, 2003Date of Patent: February 22, 2011Assignee: Adobe Systems IncorporatedInventors: Atul K. Puri, Susanta Sarkar, Ajay Pande
-
Patent number: 7890853Abstract: A markup language document (e.g. an XML document) may contain a first instance of a markup language element having an attribute with a first, platform-independent value, a second instance of the element having the attribute with a second, platform-specific value, and an indicator that the second value overrides the first. Executable code may replace both of the first and second instances with a single instance having the attribute with the second value. Alternatively the markup language document may contain a first instance of a markup language element having a first, platform-independent attribute, a second instance of the element having a second, platform-specific attribute different from the first attribute, and an indicator that the second instance extends the first. In this case executable code may replace both instances of the markup language element with a single instance having the first attribute and the second attribute.Type: GrantFiled: February 13, 2006Date of Patent: February 15, 2011Assignee: Nextair CorporationInventors: Tim Neil, Steve Grenier, Paul Chalmers
-
Publication number: 20110016384Abstract: A method is provided for displaying a page of content contained in an optimized document. In some embodiments, an optimized document is obtained, where the optimized document contains at least one page of content. A compressed page record in the optimized document that corresponds to a page to be displayed may be located. The compressed page record may be decompressed using a decompression counterpart to a technique used to compress the page record. An unfiltering may be performed on each set of values in the page record according to a filter determined to be applied to the particular set of values. Thereafter, the decompressed content may be displayed.Type: ApplicationFiled: September 24, 2010Publication date: January 20, 2011Inventors: Joshua Shagam, Robert L. Goodwin
-
Patent number: 7856345Abstract: A method for managing and/or producing an output, the method comprising the steps of providing one or more layouts, each layout being divided into a number of zones, the one or more layouts in combination at least substantially covering the output; providing a number of rendering elements, each being adapted to perform a function and/or an action; assigning one or more relations and/or one or more orderings between the rendering elements and the zones; processing the zones of the one or more layouts according to the one or more relations and/or the one or more orderings, and by means of the rendering elements; providing, for each of the zones, an output based on the rendering elements and the one or more relations and/or orderings; and collecting the output of each of the zones into a final output.Type: GrantFiled: November 21, 2002Date of Patent: December 21, 2010Assignee: Sitecore A/SInventors: Jakob H. Christensen, Ole S. Thrane
-
Patent number: 7849403Abstract: Techniques are disclosed for efficiently updating rendered content (such as content of a Web page) using a “slow-loading” content element, such as a slow-loading image. A reference is embedded within the markup language notation for the content to be rendered, where this reference identifies the source of the slow-loading content element. Delivery of the slow-loading content therefore begins automatically, when the content is rendered. Event handling attributes are specified with the reference, where values of these attributes identify client-side logic to be invoked when the associated event occurs. If the server determines that the rendered content, or some portion thereof, should be asynchronously updated, it abruptly terminates delivery of the slow-loading content. This termination triggers an event handler, which operates to automatically request reloading of the content.Type: GrantFiled: August 1, 2007Date of Patent: December 7, 2010Assignee: International Business Machines CorporationInventors: Niraj P. Joshi, Robert C. Leah, Paul F. McMahan
-
Patent number: 7836396Abstract: A style compressor for improving compression of a web document collects all the style attributes referenced by elements of the web document into a central style repository. The style compressor replaces each reference by each of the elements with a class attribute assigned in the central style repository to the replaced style attributes. Next, the style compressor organizes the style attributes in the central style repository and compresses the style attributes into a minimized selection of class attributes each referencing at least one of the style attributes. The style compressor updates the elements of the web document to reference one or more of the minimized selection of class attributes in the central style repository to produce a shorter, more efficiently organized web document.Type: GrantFiled: January 5, 2007Date of Patent: November 16, 2010Assignee: International Business Machines CorporationInventor: Daniel O. Becker
-
Patent number: 7823064Abstract: Various embodiments described herein provide systems, methods, software, and data structures that allow a page description language reader application, such as Adobe® Acrobat®, to provide a view of the assets compressed within a compressed archive. These and other embodiments including applications that may interleave one or more compressed archives within a page description language file, such as a PDF file. This interleaving leverages properties of both compressed archive files and page description language files to insert object markers around compressed assets within the compressed archives and references to at least some of the object markers in an index portion of the page description language files. As a result, compressed assets of compressed archives included in a page description language file package may be viewed within the listing of assets of the page description language file. These and other embodiments are described in greater detail below.Type: GrantFiled: August 30, 2007Date of Patent: October 26, 2010Assignee: Adobe Systems IncorporatedInventor: Philip Levy
-
Patent number: 7818664Abstract: The present invention is a computer software product and method including an XML parsing module, a translation module communicatively coupled to the XML parsing module and an output module communicatively coupled to the translation module, the output module configurable by the end user to execute a predefined event responsive to the processing of an associated XML tag. The predefined event may be selected from a group consisting of audio output modification, visual output modification, and tactile output modification. These output modifications may include, but are not limited to, voice inflection, voice gender, voice volume, voice pitch, voice speed, visual display modification, visual display cues, tactile device output modification and the like.Type: GrantFiled: March 16, 2005Date of Patent: October 19, 2010Assignee: Freedom Scientific, Inc.Inventors: Glen Gordon, Christian Hofstader, Joseph Stephen, Eric Damery
-
Patent number: 7810026Abstract: A method for optimizing a source document comprising a plurality of pages of content, comprising each of the following, is presented. A source document is obtained. An optimized document is created corresponding to the source document. Thereafter, for each page in the source document, the following are applied. A page record is created for the page. Each page record comprises a word table comprising a list of the page's words in the order that they appear in the page's content. Each page record further comprises a paragraph entry list for the page including a paragraph entry for each paragraph in the page. Each paragraph entry includes a reference to the first and last word of that paragraph in the word table. The page record is compressed using a compression technique. Thereafter, the compressed page record is stored in the optimized document.Type: GrantFiled: September 29, 2006Date of Patent: October 5, 2010Assignee: Amazon Technologies, Inc.Inventors: Joshua Shagam, Robert L Goodwin
-
Patent number: 7769904Abstract: An extensible binary mark-up language is disclosed that is compatible with existing XML standards yet provides significantly improved efficiencies for XML-based data storage and communications, particularly for narrow and low bandwidth communication media. A corresponding extensible non-binary mark-up language is also disclosed that is compatible with the XML standard. This dual-representation common message format (CMF) allows standard XML tools to be utilized in viewing and editing XML-based data and allows a CMF parser to be utilized to convert the XML formatted information into an extensible binary representation for actual communication through a medium or storage on a wide range of media.Type: GrantFiled: June 9, 2004Date of Patent: August 3, 2010Assignee: L-3 Communications Integrated Systems L.P.Inventor: Bill J. Eller
-
Patent number: 7742048Abstract: A method, system, and apparatus are provided for converting numbers to text and to digit shapes expressed in different languages. When a string of text is entered into an application program, the string is analyzed to determine whether the string of text includes a number expressed using digits. If the string of text includes a number expressed using digits, the number is semantically labeled with schema information. When a selection is received of the number, a list of actions may be provided identifying conversion actions available for the number. One of the conversion options may then be selected resulting in the conversion of the selected number to text in one or more languages. A conversion option may also be selected for converting the number digits to digits expressed using other digit shapes. The converted number may then be inserted into the string of text to replace the selected number.Type: GrantFiled: May 23, 2002Date of Patent: June 22, 2010Assignee: Microsoft CorporationInventors: Thomas G. Moore, Mohamed A. Abbar, Nader G. Issa
-
Patent number: 7743321Abstract: A method of processing data associated with processing options, wherein selected processing options select variants within the data and selected variants are activated, and variants associated with unselected options can be selected in a further iteration of the process.Type: GrantFiled: October 28, 2005Date of Patent: June 22, 2010Assignee: Hewlett-Packard Development Company, L.P.Inventor: Roger Brian Gimson
-
Patent number: 7721199Abstract: Two files are configured for more rapid comparison by removing blank spaces from the code of all of the lines in each of the files to be compared and also by attaching a control to the beginning of each line of code in each of the two sources prior to be compared. After blank spaces are removed and the controls attached, the two files are compared to determine additions, deletions and changes. The program continues to compare the strings of identical length until all of the strings have been compared. By comparing only strings of identical length, the program is able to compare two sources in substantially less time than by using conventional means of comparison.Type: GrantFiled: March 3, 2006Date of Patent: May 18, 2010Assignee: International Business Machines CorporationInventor: Charles Gordon Hooks
-
Patent number: 7721204Abstract: Methods and systems that dynamically control and/or direct document image processing based on one or more of client device display capabilites, image formats supported by the client device, network bandwidth, document file format, document content and user hints.Type: GrantFiled: July 29, 2004Date of Patent: May 18, 2010Assignee: Xerox CorporationInventors: Reiner Eschbach, Emil V. Rainero
-
Patent number: 7693848Abstract: A method and apparatus is provided for converting a document in a first format essentially comprising a flat layout structure into a structured document in a hierarchical form in accordance with predetermined attributes identified from the input format. The process comprises fragmenting the input document into a plurality of document content elements in accordance with a predetermined set of document attributes identifiable from the input document format. The content elements are clustered into selective sets having similar document attributes. The clustered sets are validated with reference to common textual properties organizational content common in documents in the collection. The clustered sets are then categorized into predetermined categories comprising structured elements of the structured document format and the document content elements are organized by hierarchical dependency from the predetermined categories wherein the organized document elements comprise the desired structured document format.Type: GrantFiled: January 10, 2005Date of Patent: April 6, 2010Assignee: Xerox CorporationInventors: Hervé Déjean, Veronika Lux, Sandrine Ribeau
-
Publication number: 20100083101Abstract: A structured document is coded as a bit stream. The structured document can be described in Extensible Markup Language (XML). The structured document can be decoded to access a subpart of the document. Hierarchized events included in the structured document are coded as corresponding events using at least one encoding table, which is updated during the coding. Hierarchized events involved in an update of the at least one encoding table are determined. Marking information is associated with the coded events corresponding to the hierarchized events determined to be involved in the update of the at least one encoding table.Type: ApplicationFiled: September 30, 2009Publication date: April 1, 2010Applicant: CANON KABUSHIKI KAISHAInventors: Franck Denoual, Herve Ruellan
-
Publication number: 20100060935Abstract: The image data processor generates storage layout information and output joint information, and then generates a page raster for storage based on the storage layout information and a page raster for output based on the output joint information. The storage layout information is based on layout information describing object layouts and includes a page formed by the same objects and reusable objects of the same arrangements and a page formed by each variable object. The output joint information combines a page for reusable objects and a page for the variable object.Type: ApplicationFiled: August 12, 2009Publication date: March 11, 2010Applicant: FUJIFILM CORPORATIONInventors: Mitsuru NAKAO, Nobuyuki SHITARA
-
Patent number: 7676742Abstract: A system and method for processing of markup language information, such as extensible markup language (XML) based control information is disclosed. The system comprises a compression sender entity and a decompression receiver entity. The sender entity uses its internal, binary representation derived from textual XML-based protocol header and encodes the binary representation as a tag-length-value (TLV) binary value. The TLV binary value is embedded as encoded data in a specifically marked comment, which may be detected by the receiver entity. The receiver entity builds up an internal, binary data structure based on the encoded data representing the original XML data. The system avoids the usage of string data to represent a small set of well known strings and uses a more compact binary encoding which reduces space and time to process the data.Type: GrantFiled: July 17, 2004Date of Patent: March 9, 2010Assignee: International Business Machines CorporationInventors: Carl Binding, Reto Hermann, Andreas Schade
-
Patent number: 7669120Abstract: A method and system for encoding a mark-up language document is provided, in which, the structure of the mark-up language document is condensed by removing those parts of the structure that are fixed, and by expressing the variable parts of the structure in terms of whether or how often they occur. A template that specifies which parts of the structure are fixed and which parts are variable may be provided to both the device that is transmitting the mark-up language document and the device that is receiving the mark-up language document.Type: GrantFiled: June 21, 2002Date of Patent: February 23, 2010Assignee: Microsoft CorporationInventors: Hongbin Liao, Qian Zhang, Wenwu Zhu
-
Patent number: 7653752Abstract: A script is scanned to detect character strings that match predetermined criteria. The detected character strings are sorted in order of their appearance frequency and are associated with substitute characters or character strings such that a character string having a greater appearance frequency than another character string is associated with substitute characters or character string having a same or smaller number of characters. The script is searched for a given character string, and the given character string is replaced with its associated substitute characters or character strings. The content is unidirectionally distributed whereby the plurality of character strings is not distributed.Type: GrantFiled: September 25, 2007Date of Patent: January 26, 2010Assignee: Sony CorporationInventor: Yoshiharu Dewa
-
Patent number: 7647552Abstract: Disclosed is a method (900) for communicating at least part of a structure of a document (104) described by a hierarchical representation (102). The method identifies (902) the hierarchical representation (eg. the tree structure) of the document (104). The identification is preferably performed using XML tags. The representation is then packetized (906) into a plurality of data packets. At least one link is then created (908) between a pair of the packets, the link acting to represent an interconnection between corresponding components (eg. structure and content) of the representation. The packets are then formed (910) into a stream for communication. The links maintain the hierachical representation within the packets.Type: GrantFiled: October 5, 2001Date of Patent: January 12, 2010Assignee: Canon Kabushiki KaishaInventor: Ernest Yiu Cheong Wan
-
Publication number: 20090327870Abstract: Provided are techniques for pipeline optimization based on polymorphic schema knowledge. A hierarchical document to be processed by a pipeline of transformations is received. It is determined whether a next downstream transformation accesses content of each schema node in an associated input schema, wherein the input schema is a polymorphic schema. In response to determining that the next downstream transformation is not accessing the content of each schema node in the associated input schema, data items corresponding to each schema node that the next downstream transformation is not accessing are collected into a single compressed event. The collected items are passed to the next downstream transformation as the single compressed event.Type: ApplicationFiled: June 26, 2008Publication date: December 31, 2009Applicant: International Business Machines CorporationInventors: Amir BAR-OR, Michael James BECKERLE
-
Patent number: 7640492Abstract: Methods and structures for parsing an Extensible Markup Language (XML) data stream are described. In the described embodiment, one or more schema modules are defined and are associated with HTTP requests that are received from a client. The schema module(s) has a function that determines whether an XML data stream conforms to a given schema that is associated with the HTTP request. If a portion of the XML data stream does not conform to the given schema, then the schema module disregards that XML data stream portion. In the described embodiment, each schema module has a plurality of states associated therewith. Each state is associated with one or more schema requirements that relate to a particular element that is evaluated by the schema module. Each state is different from the other states and the number of states is a function of the number of layers or elements that are contained within a particular XML data stream.Type: GrantFiled: July 6, 2005Date of Patent: December 29, 2009Assignee: Microsoft CorporationInventors: Joel M. Soderberg, Brian J. Deen