Patents by Inventor Samson J. Liu
Samson J. Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9330323Abstract: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.Type: GrantFiled: April 29, 2012Date of Patent: May 3, 2016Assignee: Hewlett-Packard Development Company, L.P.Inventors: Steven J Simske, Samson J. Liu
-
Patent number: 9218322Abstract: A method for producing web page content includes identifying blocks within a web page. The blocks are selectively assembled into sections. The sections are selectively assembled into article candidates. An article candidate that includes article content is distinguished from article candidates that do not include article content. Content is produced only from the article candidate distinguished as including article content.Type: GrantFiled: July 28, 2010Date of Patent: December 22, 2015Assignee: Hewlett-Packard Development Company, L.P.Inventors: Jian Fan, Ping Luo, Li-Wei Zheng, Samson J. Liu, Suk Hwan Lim, Jerry J. Liu, Yuhong Xiong
-
Patent number: 9098487Abstract: Examples disclosed herein relate to categorizing a target word based on word distance. A processor may determine a difference level threshold for a category based on difference levels between words associated with the category and determine difference levels between a target word and the words associated with the category. If one of the difference levels of the target word is below the threshold associated with the category, the processor outputs the category.Type: GrantFiled: November 29, 2012Date of Patent: August 4, 2015Assignee: Hewlett-Packard Development Company, L.P.Inventors: Giordano B Beretta, Samson J Liu, Steven J Simske
-
Publication number: 20150138605Abstract: Systems, devices and methods are provided which relate to detecting a print command on a client computer, the print command reflecting an interest to print content of an electronic document, accessible by a client computer, as a hard copy printout. One method includes analyzing the electronic document content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a new, printable document that includes the electronic document content and the identified commercial content.Type: ApplicationFiled: September 21, 2010Publication date: May 21, 2015Inventors: Samson J. Liu, Parag M. Joshi, Sheng-Wen Yang, Jian-Ming Jin
-
Publication number: 20150049949Abstract: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.Type: ApplicationFiled: April 29, 2012Publication date: February 19, 2015Inventors: Steven J Simske, Samson J. Liu
-
Patent number: 8918403Abstract: Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.Type: GrantFiled: April 19, 2010Date of Patent: December 23, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventors: Samson J. Liu, Suk Hwan Lim, Jian-Ming Jin, Yuhong Xiong, Parag M. Joshi, Nina Bhatti, Jerry J. Liu, Jian Fan, Sheng-Wen Yang
-
Patent number: 8819028Abstract: A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.Type: GrantFiled: December 14, 2009Date of Patent: August 26, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventors: Ping Luo, Jian Fan, Samson J. Liu, Yuhong Xiong, Jerry J. Liu
-
Publication number: 20140149106Abstract: Examples disclosed herein relate to categorizing a target word based on word distance. A processor may determine a difference level threshold for a category based on difference levels between words associated with the category and determine difference levels between a target word and the words associated with the category. If one of the difference levels of the target word is below the threshold associated with the category, the processor outputs the category.Type: ApplicationFiled: November 29, 2012Publication date: May 29, 2014Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.PInventors: Giordano B Beretta, Samson J. Liu, Steven J. Simske
-
Patent number: 8577887Abstract: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.Type: GrantFiled: December 16, 2009Date of Patent: November 5, 2013Assignee: Hewlett-Packard Development Company, L.P.Inventors: Parag M. Joshi, Jian-Ming Jin, Sheng-Wen Yang, Samson J. Liu, Nina Bhatti, Suk Hwan Lim
-
Publication number: 20130124953Abstract: A method for producing web page content includes identifying blocks within a web page. The blocks are selectively assembled into sections. The sections are selectively assembled into article candidates. An article candidate that includes article content is distinguished from article candidates that do not include article content. Content is produced only from the article candidate distinguished as including article content.Type: ApplicationFiled: July 28, 2010Publication date: May 16, 2013Inventors: Jian Fan, Ping Luo, Li-Wei Zheng, Samson J. Liu, Suk Hwan Lim, Jerry J. Liu, Yuhong Xiong
-
Publication number: 20130114105Abstract: Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.Type: ApplicationFiled: April 19, 2010Publication date: May 9, 2013Inventors: Samson J. Liu, Suk Hwan Lim, Jian-Ming Jin, Yuhong Xiong, Parag M. Joshi, Nina Bhatti, Jerry J. Liu, Jian Fan, Sheng-Wen Yang
-
Publication number: 20120303636Abstract: A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.Type: ApplicationFiled: December 14, 2009Publication date: November 29, 2012Inventors: Ping Luo, Jian Fan, Samson J. Liu, Yuhong Xiong, Jerry J. Liu
-
Publication number: 20120246552Abstract: Examples disclosed herein are example systems and methods to provide a particular type of uniform resource locator. In one example, a processor identifies webpage source code associated with a list of text associated with the type of uniform resource locator. The processor may identify a uniform resource locator within the identified webpage source code and provide the uniform resource locator.Type: ApplicationFiled: March 21, 2011Publication date: September 27, 2012Inventors: Samson J. Liu, Suk Hwan Lim, Jerry J. Liu
-
Publication number: 20120150637Abstract: In one embodiment, a system and method relate to detecting a print command received by a network browser of a client computer, the print command reflecting an interest to print content of a network page displayed in the network browser as a hard copy printout, analyzing the network page content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a document that includes the network page content and the identified commercial content.Type: ApplicationFiled: August 26, 2009Publication date: June 14, 2012Inventors: Samson J. Liu, Parag M. Joshi
-
Publication number: 20110145249Abstract: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.Type: ApplicationFiled: December 16, 2009Publication date: June 16, 2011Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.Inventors: Parag M. Joshi, Jian-Ming Jin, Sheng-Wen Yang, Samson J. Liu, Nina Bhatti, Suk Hwan Lim
-
Publication number: 20040146211Abstract: An apparatus may include elements to send a plurality of frame rate indications associated with a frame included in a first stream segment, to receive the plurality of frame rate indications, and to generate a non-predicted frame associated with a frame included in a second stream segment. A method may include reducing a time difference between a first stream segment and a second stream segment and combining the first stream segment and the second stream segment to provide a single stream. In addition, the method may include encoding a first frame included in the single stream according to an associated first frame rate and encoding a second frame included in the single stream according to an associated second frame rate, wherein the second frame rate is different than the first frame rate.Type: ApplicationFiled: January 29, 2003Publication date: July 29, 2004Inventors: Verna E. Knapp, Samson J. Liu
-
Patent number: 6011868Abstract: A bitstream quality analysis system includes a demultiplexer, a bitstream quality analyzer, and a graphical user interface. The demultiplexer receives a bitstream and separates from the bitstream at least one elementary bitstream that includes a video elementary bitstream. The bitstream quality analyzer receives the video elementary bitstream from the demultiplexer and parses the video elementary bitstream to extract parameters characterizing the video elementary bitstream. The bitstream quality analyzer provides the extracted parameters to the graphical user interface, which displays the extracted parameters characterizing the video elementary bitstream. A user can monitor the video elementary bitstream at varying levels of detail, and can perform varying levels of quality analysis on the video elementary bitstream.Type: GrantFiled: April 4, 1997Date of Patent: January 4, 2000Assignee: Hewlett-Packard CompanyInventors: Christian J. van den Branden, Chong T. Ong, Samson J. Liu, Mark A. Leonard
-
Patent number: 5880767Abstract: The present invention provides an effective, low cost method and system for enhancing various types of images including photograph, CD, video, and graphic art images. The method of enhancing the input image, includes the steps of: filtering the input image to extract m different frequency components r.sub.k ; adaptively sharpening the m different frequency components r.sub.k, where the amount of sharpening for each component r.sub.k corresponds to a sharpening function g.sub.k ?r.sub.k !; and adding the adaptively sharpened m different frequency components g.sub.k ?r.sub.k ! to the input image. Because the sharpening function is typically nonlinear, the step of determining the value of the adaptive frequency component corresponding to the sharpening function is achieved by mapping the filtered component by the corresponding sharpening function.Type: GrantFiled: September 11, 1996Date of Patent: March 9, 1999Assignee: Hewlett-Packard CompanyInventor: Samson J. Liu