Patents by Inventor Kun Wu Huang

Kun Wu Huang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11763102
    Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.
    Type: Grant
    Filed: February 26, 2021
    Date of Patent: September 19, 2023
    Assignee: EMC IP Holding Company, LLC
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Patent number: 11429648
    Abstract: Embodiments of the present disclosure generally relate to a method and device for creating an index. For example, the embodiments of the present disclosure propose a method for creating an index, comprising: dividing a document into a plurality of regions; determining the number of times that a token appears in the plurality of regions, the token including at least one character in the document; assigning respective weights to the plurality of regions; and creating an inverted document linked list directed to the token based on the number of times that the token appears in the plurality of regions and respective weights of the plurality of regions. In addition, the embodiments of the present disclosure propose a corresponding device and computer program product for creating an index.
    Type: Grant
    Filed: April 28, 2020
    Date of Patent: August 30, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Winston Lei Zhang, Charlie Chen, Kun Wu Huang, Jingjing Liu, Duke Dai
  • Patent number: 11068536
    Abstract: Embodiments of the present disclosure relate to a method and apparatus for managing a document index. The method comprises determining an independently updatable field in a plurality of documents, the independently updatable field comprising at least one item. The method further comprises creating an index for an item in the independently updatable field, the index containing an identifier of a document comprising the item, the document being included in the plurality of documents. Furthermore, the method further comprises storing the identifier of the document in blocks such that the index is updatable without modifying the identifier of the document.
    Type: Grant
    Filed: June 22, 2017
    Date of Patent: July 20, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Patent number: 11048763
    Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: June 29, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
  • Publication number: 20210182506
    Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.
    Type: Application
    Filed: February 26, 2021
    Publication date: June 17, 2021
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Patent number: 10936667
    Abstract: Embodiments of the present disclosure provide a solution for indicating a search result. A method of indicating a search result is disclosed, which includes, in response to receiving a query term, searching for an electronic document having metadata related to the query term. The method further includes, in response to the electronic document being searched, locating a metadata term matching with the query term from the metadata of the electronic document. The method further includes providing an indication highlighting the metadata term.
    Type: Grant
    Filed: September 21, 2017
    Date of Patent: March 2, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
  • Patent number: 10936829
    Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.
    Type: Grant
    Filed: June 21, 2017
    Date of Patent: March 2, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Publication number: 20200257710
    Abstract: Embodiments of the present disclosure generally relate to a method and device for creating an index. For example, the embodiments of the present disclosure propose a method for creating an index, comprising: dividing a document into a plurality of regions; determining the number of times that a token appears in the plurality of regions, the token including at least one character in the document; assigning respective weights to the plurality of regions; and creating an inverted document linked list directed to the token based on the number of times that the token appears in the plurality of regions and respective weights of the plurality of regions. In addition, the embodiments of the present disclosure propose a corresponding device and computer program product for creating an index.
    Type: Application
    Filed: April 28, 2020
    Publication date: August 13, 2020
    Inventors: Winston Lei Zhang, Charlie Chen, Kun Wu Huang, Jingjing Liu, Duke Dai
  • Publication number: 20200133981
    Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.
    Type: Application
    Filed: December 31, 2019
    Publication date: April 30, 2020
    Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
  • Patent number: 10546024
    Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.
    Type: Grant
    Filed: March 20, 2017
    Date of Patent: January 28, 2020
    Assignee: EMC IP Holding Company LLC
    Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
  • Patent number: 10331717
    Abstract: Embodiments of the present disclosure provide a method and an apparatus for determining a similar document set to a target document from a plurality of documents. Each of the multiple documents and the target document may include a plurality of words, and each of words corresponds to a different integer. The method comprises: for each document among the plurality of documents and the target document, obtaining a set of integers associated with a document based on a set of words associated with the document, converting the set of integers associated with the document into a vector with a same dimension based on a predefined conversion rule; and determining the similar document set based on differences between the corresponding vectors for the multiple documents and the vector for the target document.
    Type: Grant
    Filed: December 20, 2016
    Date of Patent: June 25, 2019
    Assignee: EMC IP Holding Company LLC
    Inventors: Sean Kun Zhao, Chao Chen, Winston Lei Zhang, Jingjing Liu, Kun Wu Huang
  • Publication number: 20180089329
    Abstract: Embodiments of the present disclosure provide a method and device for managing index. For example, there is provided a method, comprising: obtaining a first index term in a first index, the first index term corresponding to a first index content in the first index, the first index content indicating a position of the first index term in a document; generating a reading of the first index term; and adding the reading as a second index term into a second index, the reading corresponding to a second index content indicating the first index term. Corresponding device and computer program product are also provided.
    Type: Application
    Filed: September 21, 2017
    Publication date: March 29, 2018
    Inventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
  • Publication number: 20180089335
    Abstract: Embodiments of the present disclosure provide a solution for indicating a search result. A method of indicating a search result is disclosed, which includes, in response to receiving a query term, searching for an electronic document having metadata related to the query term. The method further includes, in response to the electronic document being searched, locating a metadata term matching with the query term from the metadata of the electronic document. The method further includes providing an indication highlighting the metadata term.
    Type: Application
    Filed: September 21, 2017
    Publication date: March 29, 2018
    Inventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
  • Publication number: 20170371978
    Abstract: Embodiments of the present disclosure relate to a method and apparatus for managing a document index. The method comprises determining an independently updatable field in a plurality of documents, the independently updatable field comprising at least one item. The method further comprises creating an index for an item in the independently updatable field, the index containing an identifier of a document comprising the item, the document being included in the plurality of documents. Furthermore, the method further comprises storing the identifier of the document in blocks such that the index is updatable without modifying the identifier of the document.
    Type: Application
    Filed: June 22, 2017
    Publication date: December 28, 2017
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Publication number: 20170364510
    Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.
    Type: Application
    Filed: June 21, 2017
    Publication date: December 21, 2017
    Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
  • Publication number: 20170270114
    Abstract: Embodiments of the present disclosure provide a method and device for searching a character string. In one embodiment, a method of searching a character string is provided. The method comprises: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string. A corresponding device and a computer program product are also disclosed.
    Type: Application
    Filed: March 20, 2017
    Publication date: September 21, 2017
    Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
  • Publication number: 20170270184
    Abstract: Embodiments of the present disclosure provide a method and device for processing objects to be searched. The method comprises: receiviug a first input indicating a constraint associated with an object; receiving a second input indicating a category to which the object belong; and establishing, based on the first input and the second input, a classification condition associating the constraint with the category as a part of a classification policy which is used for classifying the object into a category to create a search index. In addition, embodiments of the present disclosure further disclose a method and device for creating a search index for an object to be searched.
    Type: Application
    Filed: March 17, 2017
    Publication date: September 21, 2017
    Inventors: Kun Wu Huang, Chao Chen, Winston Lei Zhang, Jingjing Liu, Duke Hongtao Dai
  • Publication number: 20170185671
    Abstract: Embodiments of the present disclosure provide a method and an apparatus for determining a similar document set to a target document from a plurality of documents. Each of the multiple documents and the target document may include a plurality of words, and each of words corresponds to a different integer. The method comprises: for each document among the plurality of documents and the target document, obtaining a set of integers associated with a document based on a set of words associated with the document, converting the set of integers associated with the document into a vector with a same dimension based on a predefined conversion rule; and determining the similar document set based on differences between the corresponding vectors for the multiple documents and the vector for the target document.
    Type: Application
    Filed: December 20, 2016
    Publication date: June 29, 2017
    Inventors: Sean Kun Zhao, Chao Chen, Winston Lei Zhang, Jingjing Liu, Kun Wu Huang