Patents by Inventor Xiaofan Lin
Xiaofan Lin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12106333Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.Type: GrantFiled: May 23, 2023Date of Patent: October 1, 2024Assignee: A9.com, Inc.Inventors: Douglas Ryan Gray, Arnab Sanat Kumar Dhua, Xiaofan Lin, Zhijiang Mark Lu
-
Patent number: 11960528Abstract: When a first search query including an image of an item is received to search for items associated with similar images, a second search query that includes text based on the image is generated. The text may be based on previous queries associated with the depicted item, visual features of the image, or text that is present in the image. The results from the first search query are scored based on their correspondence with the image of the item. Results having a score greater than a threshold are presented first in the output, followed by a selected number of results from the second search query. Results from the first search query that are associated with a score less than the threshold may be presented after the results from the second search query. This presentation increases the likelihood that items presented earlier in the output are relevant to the initial query.Type: GrantFiled: September 30, 2022Date of Patent: April 16, 2024Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Aditya Deorha, Xiaofan Lin, Shashank Shekhar
-
Publication number: 20230298073Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.Type: ApplicationFiled: May 23, 2023Publication date: September 21, 2023Inventors: Douglas Ryan Gray, Arnab Sanat Kumar Dhua, Xiaofan Lin, Zhijiang Mark Lu
-
Patent number: 11699174Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.Type: GrantFiled: February 8, 2021Date of Patent: July 11, 2023Assignee: A9.com, Inc.Inventors: Douglas Ryan Gray, Arnab Sanat Kumar Dhua, Xiaofan Lin, Zhijiang Mark Lu
-
Patent number: 11227326Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.Type: GrantFiled: November 4, 2019Date of Patent: January 18, 2022Assignee: A9.com, Inc.Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
-
Publication number: 20210174401Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.Type: ApplicationFiled: February 8, 2021Publication date: June 10, 2021Inventors: Douglas Ryan Gray, Arnab Sanat Kumar Dhua, Xiaofan Lin, Zhijiang Mark Lu
-
Patent number: 10963924Abstract: A computing device can obtain data describing at least one document, the at least one document referencing at least one media object, wherein a portion of the at least one media object includes one or more characters. The computing device can obtain data describing the one or more characters in the at least one media object in the at least one document. The computing device can generate an updated copy of the at least one document that includes the data describing the one or more characters in the at least one media object. The computing device can present, on a display screen of the computing device and through an interface, the updated copy of the at least one document, wherein the one or more characters in the at least one media object are able to be selected or searched.Type: GrantFiled: March 10, 2014Date of Patent: March 30, 2021Assignee: A9.com, Inc.Inventors: Douglas Ryan Gray, Arnab Sanat Kumar Dhua, Xiaofan Lin, Zhijiang Mark Lu
-
Patent number: 10769200Abstract: A user can capture an image of a text object of interest and have that image submitted for processing. The image can be pre-processed to improve quality and then submitted to an optical character recognition process to identify the words, characters, or strings in the image. At least some of these results can be submitted as a query to a search engine to obtain potential matches. In order to improve the accuracy of the results, information such as the titles for the results can be compared against each recognized word, character, or string from the image, including the ordering of those elements. An updated relevancy score can then be generated based on the full, ordered set. The recognized text is also analyzed to attempt to recognize model numbers or other identifiers that can be weighted more heavily as being indicative of accurate matches. Matches are selected from the re-ranked results.Type: GrantFiled: July 1, 2015Date of Patent: September 8, 2020Assignee: A9.COM, INC.Inventor: Xiaofan Lin
-
Publication number: 20200143245Abstract: An aircraft includes a propulsion system, a sensor system, a control system, and a processing system including a memory and a processor. The memory is configured to store computer-executable instructions. The processor is configured to access the memory and to execute the computer-executable instructions to perform the following steps: obtaining a set of first weights of a processing unit of a neural network; ternarizing each weight included in the set of first weights to obtain a set of second weights; generating an output of the processing unit based on the set of second weights and a set of inputs of the processing unit; and training weights included in the set of first weights of the processing unit of the neural network based on an error cost function including an error term and a structurally sparse term.Type: ApplicationFiled: November 27, 2019Publication date: May 7, 2020Inventors: Wei PAN, Jian CUI, Xiaofan LIN, Cong ZHAO
-
Publication number: 20200068132Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.Type: ApplicationFiled: November 4, 2019Publication date: February 27, 2020Inventors: XIAOFAN LIN, ARNAB SANAT KUMAR DHUA, DOUGLAS RYAN GRAY, ATUL KUMAR, YU LOU
-
Patent number: 10540378Abstract: Approaches provide for analyzing image data to determine and/or recognize text in the image data. The recognized text can be used to generate a search query that can be automatically submitted to a search engine without having to type the search query to identify a product (or related products) associated with the image. For example, a camera of a computing device can be used to capture a live camera view (or single images) an item. An application executing on the computing device (or at least in communication with the computing device) can analyze the image data of the live camera view to determine a set of keywords (e.g., identified text) based on visual features extracted from the image data. The keywords can be used to query an index of product titles, common search queries, among other indexed data to return a ranked list of search suggestions based on a relevance function.Type: GrantFiled: June 28, 2016Date of Patent: January 21, 2020Assignee: A9.com, Inc.Inventors: Edward Hsiao, Douglas Ryan Gray, Nityananda Jayadevaprakash, Xiaofan Lin, Mark Jay Nitzberg, Shruti Sheorey
-
Patent number: 10506168Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.Type: GrantFiled: April 5, 2017Date of Patent: December 10, 2019Assignee: A9.COM, INC.Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
-
Patent number: 10445569Abstract: Approaches provide for recognizing and locating text represented in image data. For example, image data that includes representations of text can be obtained. A width-focused recognition engine can be configured to analyze the image data to determine a base-set of words. The base-set of words can be associated with logical structure information that describes a geometric relationship between words in the base-set of words. A set of bounding boxes that includes one or more base words can be determined, as well as a confidence value for each base word. A depth-focused recognition engine can be configured to analyze the image data to determine a focused-set of words, the focused-set of words associated with a set of bounding boxes and confidence values for respective words. A set of merged words can be determined from a set of overlapping bounding boxes that overlap a threshold amount.Type: GrantFiled: August 30, 2016Date of Patent: October 15, 2019Assignee: A9.COM, INC.Inventors: Xiaofan Lin, Son Dinh Tran
-
Patent number: 10140549Abstract: Various embodiments may increase scalability of image representations stored in a database for use in image matching and retrieval. For example, a system providing image matching can obtain images of a number of inventory items, extract features from each image using a feature extraction algorithm, and transform the same into their feature descriptor representations. These feature descriptor representations can be subsequently stored and used to compare against query images submitted by users. Though the size of each feature descriptor representation isn't particularly large, the total number of these descriptors requires a substantial amount of storage space. Accordingly, feature descriptor representations are compressed to minimize storage and, in one example, machine learning can be used to compensate for information lost as a result of the compression.Type: GrantFiled: February 27, 2017Date of Patent: November 27, 2018Assignee: A9.COM, INC.Inventors: Simant Dube, Sunil Ramesh, Xiaofan Lin, Arnab Sanat Kumar Dhua, Colin Jon Taylor, Jaishanker K. Pillai
-
Patent number: 10121229Abstract: Systems and approaches are provided for optimizing self-portraiture. The background of the self-portrait can be enhanced by image registration or stitching techniques of images captured using one or more conventional cameras. Multiple standard resolution images can be stitched together to generate a panoramic or a composite image of a higher resolution. Foreground elements, such as one or more representations of users, can also be enhanced in various ways. The representations of the users can be composited to exclude undesirable elements, such as image data of one of the users extending her arm to capture the self-portrait. An ideal pose of the users can automatically be selected and other image enhancements, such as histogram optimization, brightness and contrast optimization, color-cast correction, or reduction or removal of noise, can automatically be performed to minimize user effort in capturing self-portraits.Type: GrantFiled: December 30, 2015Date of Patent: November 6, 2018Assignee: A9.com, Inc.Inventors: Douglas Ryan Gray, Colin Jon Taylor, Xiaofan Lin
-
Patent number: 10038839Abstract: Various approaches provide for detecting and recognizing text to enable a user to perform various functions or tasks. For example, a user could point a camera at an object with text, in order to capture an image of that object. The camera can be integrated with a portable computing device that is capable of taking the image and processing the image (or providing the image for processing) to recognize, identify, and/or isolate the text in order to send the image of the object as well as recognized text to an application, function, or system, such as an electronic marketplace.Type: GrantFiled: June 1, 2017Date of Patent: July 31, 2018Assignee: A.9.com, Inc.Inventors: Adam Wiggen Kraft, Kathy Wing Lam Ma, Xiaofan Lin, Arnab Sanat Kumar Dhua, Yu Lou
-
Patent number: 10013624Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.Type: GrantFiled: December 16, 2015Date of Patent: July 3, 2018Assignee: A9.com, Inc.Inventors: Douglas Ryan Gray, Xiaofan Lin, Arnab Sanat Kumar Dhua, Yu Lou
-
Patent number: 9934526Abstract: Various embodiments enable a process to automatically attempt to select the most relevant words associated with products available for purchase from an electronic marketplace from an image frame. For example, an image frame containing text can be obtained and analyzed with an optical character recognition. The recognized words can then be preprocessed using various filtering and scoring techniques to narrow down a volume of text to a few relevant query terms. These query terms can then be sent to a search engine associated with the electronic marketplace to return relevant products to a user.Type: GrantFiled: June 27, 2013Date of Patent: April 3, 2018Assignee: A9.com, INC.Inventors: Arnab Sunat Kumar Dhua, Douglas Ryan Gray, Xiaofan Lin, Yu Lou, Adam Wiggen Kraft, Sunil Ramesh
-
Patent number: 9870633Abstract: Various embodiments enable a computing device to perform tasks such as highlighting words in an augmented reality view that are important to a user. For example, word lists can be generated and the user, by pointing a camera of a computing device at a volume of text, can cause words from the word list within the volume of text to be highlighted in a live field of view of the camera displayed thereon. Accordingly, users can quickly identify textual information that is meaningful to them in an Augmented Reality view to aid the user in sifting through real-world text.Type: GrantFiled: December 21, 2016Date of Patent: January 16, 2018Assignee: A9.COM, INC.Inventors: Adam Wiggen Kraft, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Xiaofan Lin, Yu Lou, Sunil Ramesh, Colin Jon Taylor, David Creighton Mott
-
Publication number: 20170272648Abstract: Various approaches provide for detecting and recognizing text to enable a user to perform various functions or tasks. For example, a user could point a camera at an object with text, in order to capture an image of that object. The camera can be integrated with a portable computing device that is capable of taking the image and processing the image (or providing the image for processing) to recognize, identify, and/or isolate the text in order to send the image of the object as well as recognized text to an application, function, or system, such as an electronic marketplace.Type: ApplicationFiled: June 1, 2017Publication date: September 21, 2017Inventors: ADAM WIGGEN KRAFT, KATHY WING LAM MA, XIAOFAN LIN, ARNAB SANAT KUMAR DHUA, YU LOU