Patents by Inventor Samson J. Liu

Samson J. Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9330323
    Abstract: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.
    Type: Grant
    Filed: April 29, 2012
    Date of Patent: May 3, 2016
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Steven J Simske, Samson J. Liu
  • Patent number: 9218322
    Abstract: A method for producing web page content includes identifying blocks within a web page. The blocks are selectively assembled into sections. The sections are selectively assembled into article candidates. An article candidate that includes article content is distinguished from article candidates that do not include article content. Content is produced only from the article candidate distinguished as including article content.
    Type: Grant
    Filed: July 28, 2010
    Date of Patent: December 22, 2015
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Jian Fan, Ping Luo, Li-Wei Zheng, Samson J. Liu, Suk Hwan Lim, Jerry J. Liu, Yuhong Xiong
  • Patent number: 9098487
    Abstract: Examples disclosed herein relate to categorizing a target word based on word distance. A processor may determine a difference level threshold for a category based on difference levels between words associated with the category and determine difference levels between a target word and the words associated with the category. If one of the difference levels of the target word is below the threshold associated with the category, the processor outputs the category.
    Type: Grant
    Filed: November 29, 2012
    Date of Patent: August 4, 2015
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Giordano B Beretta, Samson J Liu, Steven J Simske
  • Publication number: 20150138605
    Abstract: Systems, devices and methods are provided which relate to detecting a print command on a client computer, the print command reflecting an interest to print content of an electronic document, accessible by a client computer, as a hard copy printout. One method includes analyzing the electronic document content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a new, printable document that includes the electronic document content and the identified commercial content.
    Type: Application
    Filed: September 21, 2010
    Publication date: May 21, 2015
    Inventors: Samson J. Liu, Parag M. Joshi, Sheng-Wen Yang, Jian-Ming Jin
  • Publication number: 20150049949
    Abstract: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.
    Type: Application
    Filed: April 29, 2012
    Publication date: February 19, 2015
    Inventors: Steven J Simske, Samson J. Liu
  • Patent number: 8918403
    Abstract: Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.
    Type: Grant
    Filed: April 19, 2010
    Date of Patent: December 23, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Samson J. Liu, Suk Hwan Lim, Jian-Ming Jin, Yuhong Xiong, Parag M. Joshi, Nina Bhatti, Jerry J. Liu, Jian Fan, Sheng-Wen Yang
  • Patent number: 8819028
    Abstract: A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.
    Type: Grant
    Filed: December 14, 2009
    Date of Patent: August 26, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Ping Luo, Jian Fan, Samson J. Liu, Yuhong Xiong, Jerry J. Liu
  • Publication number: 20140149106
    Abstract: Examples disclosed herein relate to categorizing a target word based on word distance. A processor may determine a difference level threshold for a category based on difference levels between words associated with the category and determine difference levels between a target word and the words associated with the category. If one of the difference levels of the target word is below the threshold associated with the category, the processor outputs the category.
    Type: Application
    Filed: November 29, 2012
    Publication date: May 29, 2014
    Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P
    Inventors: Giordano B Beretta, Samson J. Liu, Steven J. Simske
  • Patent number: 8577887
    Abstract: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.
    Type: Grant
    Filed: December 16, 2009
    Date of Patent: November 5, 2013
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Parag M. Joshi, Jian-Ming Jin, Sheng-Wen Yang, Samson J. Liu, Nina Bhatti, Suk Hwan Lim
  • Publication number: 20130124953
    Abstract: A method for producing web page content includes identifying blocks within a web page. The blocks are selectively assembled into sections. The sections are selectively assembled into article candidates. An article candidate that includes article content is distinguished from article candidates that do not include article content. Content is produced only from the article candidate distinguished as including article content.
    Type: Application
    Filed: July 28, 2010
    Publication date: May 16, 2013
    Inventors: Jian Fan, Ping Luo, Li-Wei Zheng, Samson J. Liu, Suk Hwan Lim, Jerry J. Liu, Yuhong Xiong
  • Publication number: 20130114105
    Abstract: Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.
    Type: Application
    Filed: April 19, 2010
    Publication date: May 9, 2013
    Inventors: Samson J. Liu, Suk Hwan Lim, Jian-Ming Jin, Yuhong Xiong, Parag M. Joshi, Nina Bhatti, Jerry J. Liu, Jian Fan, Sheng-Wen Yang
  • Publication number: 20120303636
    Abstract: A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.
    Type: Application
    Filed: December 14, 2009
    Publication date: November 29, 2012
    Inventors: Ping Luo, Jian Fan, Samson J. Liu, Yuhong Xiong, Jerry J. Liu
  • Publication number: 20120246552
    Abstract: Examples disclosed herein are example systems and methods to provide a particular type of uniform resource locator. In one example, a processor identifies webpage source code associated with a list of text associated with the type of uniform resource locator. The processor may identify a uniform resource locator within the identified webpage source code and provide the uniform resource locator.
    Type: Application
    Filed: March 21, 2011
    Publication date: September 27, 2012
    Inventors: Samson J. Liu, Suk Hwan Lim, Jerry J. Liu
  • Publication number: 20120150637
    Abstract: In one embodiment, a system and method relate to detecting a print command received by a network browser of a client computer, the print command reflecting an interest to print content of a network page displayed in the network browser as a hard copy printout, analyzing the network page content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a document that includes the network page content and the identified commercial content.
    Type: Application
    Filed: August 26, 2009
    Publication date: June 14, 2012
    Inventors: Samson J. Liu, Parag M. Joshi
  • Publication number: 20110145249
    Abstract: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.
    Type: Application
    Filed: December 16, 2009
    Publication date: June 16, 2011
    Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.
    Inventors: Parag M. Joshi, Jian-Ming Jin, Sheng-Wen Yang, Samson J. Liu, Nina Bhatti, Suk Hwan Lim
  • Publication number: 20040146211
    Abstract: An apparatus may include elements to send a plurality of frame rate indications associated with a frame included in a first stream segment, to receive the plurality of frame rate indications, and to generate a non-predicted frame associated with a frame included in a second stream segment. A method may include reducing a time difference between a first stream segment and a second stream segment and combining the first stream segment and the second stream segment to provide a single stream. In addition, the method may include encoding a first frame included in the single stream according to an associated first frame rate and encoding a second frame included in the single stream according to an associated second frame rate, wherein the second frame rate is different than the first frame rate.
    Type: Application
    Filed: January 29, 2003
    Publication date: July 29, 2004
    Inventors: Verna E. Knapp, Samson J. Liu
  • Patent number: 6011868
    Abstract: A bitstream quality analysis system includes a demultiplexer, a bitstream quality analyzer, and a graphical user interface. The demultiplexer receives a bitstream and separates from the bitstream at least one elementary bitstream that includes a video elementary bitstream. The bitstream quality analyzer receives the video elementary bitstream from the demultiplexer and parses the video elementary bitstream to extract parameters characterizing the video elementary bitstream. The bitstream quality analyzer provides the extracted parameters to the graphical user interface, which displays the extracted parameters characterizing the video elementary bitstream. A user can monitor the video elementary bitstream at varying levels of detail, and can perform varying levels of quality analysis on the video elementary bitstream.
    Type: Grant
    Filed: April 4, 1997
    Date of Patent: January 4, 2000
    Assignee: Hewlett-Packard Company
    Inventors: Christian J. van den Branden, Chong T. Ong, Samson J. Liu, Mark A. Leonard
  • Patent number: 5880767
    Abstract: The present invention provides an effective, low cost method and system for enhancing various types of images including photograph, CD, video, and graphic art images. The method of enhancing the input image, includes the steps of: filtering the input image to extract m different frequency components r.sub.k ; adaptively sharpening the m different frequency components r.sub.k, where the amount of sharpening for each component r.sub.k corresponds to a sharpening function g.sub.k ?r.sub.k !; and adding the adaptively sharpened m different frequency components g.sub.k ?r.sub.k ! to the input image. Because the sharpening function is typically nonlinear, the step of determining the value of the adaptive frequency component corresponding to the sharpening function is achieved by mapping the filtered component by the corresponding sharpening function.
    Type: Grant
    Filed: September 11, 1996
    Date of Patent: March 9, 1999
    Assignee: Hewlett-Packard Company
    Inventor: Samson J. Liu