Patents by Inventor Kun Wu Huang
Kun Wu Huang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11763102Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.Type: GrantFiled: February 26, 2021Date of Patent: September 19, 2023Assignee: EMC IP Holding Company, LLCInventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Patent number: 11429648Abstract: Embodiments of the present disclosure generally relate to a method and device for creating an index. For example, the embodiments of the present disclosure propose a method for creating an index, comprising: dividing a document into a plurality of regions; determining the number of times that a token appears in the plurality of regions, the token including at least one character in the document; assigning respective weights to the plurality of regions; and creating an inverted document linked list directed to the token based on the number of times that the token appears in the plurality of regions and respective weights of the plurality of regions. In addition, the embodiments of the present disclosure propose a corresponding device and computer program product for creating an index.Type: GrantFiled: April 28, 2020Date of Patent: August 30, 2022Assignee: EMC IP Holding Company LLCInventors: Winston Lei Zhang, Charlie Chen, Kun Wu Huang, Jingjing Liu, Duke Dai
-
Patent number: 11068536Abstract: Embodiments of the present disclosure relate to a method and apparatus for managing a document index. The method comprises determining an independently updatable field in a plurality of documents, the independently updatable field comprising at least one item. The method further comprises creating an index for an item in the independently updatable field, the index containing an identifier of a document comprising the item, the document being included in the plurality of documents. Furthermore, the method further comprises storing the identifier of the document in blocks such that the index is updatable without modifying the identifier of the document.Type: GrantFiled: June 22, 2017Date of Patent: July 20, 2021Assignee: EMC IP Holding Company LLCInventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Patent number: 11048763Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.Type: GrantFiled: December 31, 2019Date of Patent: June 29, 2021Assignee: EMC IP Holding Company LLCInventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
-
Publication number: 20210182506Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.Type: ApplicationFiled: February 26, 2021Publication date: June 17, 2021Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Patent number: 10936667Abstract: Embodiments of the present disclosure provide a solution for indicating a search result. A method of indicating a search result is disclosed, which includes, in response to receiving a query term, searching for an electronic document having metadata related to the query term. The method further includes, in response to the electronic document being searched, locating a metadata term matching with the query term from the metadata of the electronic document. The method further includes providing an indication highlighting the metadata term.Type: GrantFiled: September 21, 2017Date of Patent: March 2, 2021Assignee: EMC IP Holding Company LLCInventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
-
Patent number: 10936829Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.Type: GrantFiled: June 21, 2017Date of Patent: March 2, 2021Assignee: EMC IP Holding Company LLCInventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Publication number: 20200257710Abstract: Embodiments of the present disclosure generally relate to a method and device for creating an index. For example, the embodiments of the present disclosure propose a method for creating an index, comprising: dividing a document into a plurality of regions; determining the number of times that a token appears in the plurality of regions, the token including at least one character in the document; assigning respective weights to the plurality of regions; and creating an inverted document linked list directed to the token based on the number of times that the token appears in the plurality of regions and respective weights of the plurality of regions. In addition, the embodiments of the present disclosure propose a corresponding device and computer program product for creating an index.Type: ApplicationFiled: April 28, 2020Publication date: August 13, 2020Inventors: Winston Lei Zhang, Charlie Chen, Kun Wu Huang, Jingjing Liu, Duke Dai
-
Publication number: 20200133981Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.Type: ApplicationFiled: December 31, 2019Publication date: April 30, 2020Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
-
Patent number: 10546024Abstract: Techniques for searching a character string involve: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string.Type: GrantFiled: March 20, 2017Date of Patent: January 28, 2020Assignee: EMC IP Holding Company LLCInventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
-
Patent number: 10331717Abstract: Embodiments of the present disclosure provide a method and an apparatus for determining a similar document set to a target document from a plurality of documents. Each of the multiple documents and the target document may include a plurality of words, and each of words corresponds to a different integer. The method comprises: for each document among the plurality of documents and the target document, obtaining a set of integers associated with a document based on a set of words associated with the document, converting the set of integers associated with the document into a vector with a same dimension based on a predefined conversion rule; and determining the similar document set based on differences between the corresponding vectors for the multiple documents and the vector for the target document.Type: GrantFiled: December 20, 2016Date of Patent: June 25, 2019Assignee: EMC IP Holding Company LLCInventors: Sean Kun Zhao, Chao Chen, Winston Lei Zhang, Jingjing Liu, Kun Wu Huang
-
Publication number: 20180089329Abstract: Embodiments of the present disclosure provide a method and device for managing index. For example, there is provided a method, comprising: obtaining a first index term in a first index, the first index term corresponding to a first index content in the first index, the first index content indicating a position of the first index term in a document; generating a reading of the first index term; and adding the reading as a second index term into a second index, the reading corresponding to a second index content indicating the first index term. Corresponding device and computer program product are also provided.Type: ApplicationFiled: September 21, 2017Publication date: March 29, 2018Inventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
-
Publication number: 20180089335Abstract: Embodiments of the present disclosure provide a solution for indicating a search result. A method of indicating a search result is disclosed, which includes, in response to receiving a query term, searching for an electronic document having metadata related to the query term. The method further includes, in response to the electronic document being searched, locating a metadata term matching with the query term from the metadata of the electronic document. The method further includes providing an indication highlighting the metadata term.Type: ApplicationFiled: September 21, 2017Publication date: March 29, 2018Inventors: Kun Wu Huang, Charlie Chen, Winston Lei Zhang, Jingjing Liu, Duke Dai
-
Publication number: 20170371978Abstract: Embodiments of the present disclosure relate to a method and apparatus for managing a document index. The method comprises determining an independently updatable field in a plurality of documents, the independently updatable field comprising at least one item. The method further comprises creating an index for an item in the independently updatable field, the index containing an identifier of a document comprising the item, the document being included in the plurality of documents. Furthermore, the method further comprises storing the identifier of the document in blocks such that the index is updatable without modifying the identifier of the document.Type: ApplicationFiled: June 22, 2017Publication date: December 28, 2017Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Publication number: 20170364510Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.Type: ApplicationFiled: June 21, 2017Publication date: December 21, 2017Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
-
Publication number: 20170270114Abstract: Embodiments of the present disclosure provide a method and device for searching a character string. In one embodiment, a method of searching a character string is provided. The method comprises: determining a first set of documents including a first token in the character string, and a second set of documents including a second token in the character string; and generating a third set of documents based on the first and second sets of documents, in the third set of documents: i) a document being included in the first and second sets of documents, and ii) a distance between the first and second tokens in the document being equal to a distance between the first and second tokens in the character string. A corresponding device and a computer program product are also disclosed.Type: ApplicationFiled: March 20, 2017Publication date: September 21, 2017Inventors: Duke Hongtao Dai, Winston Lei Zhang, Chao Chen, Kun Wu Huang, Jingjing Liu
-
Publication number: 20170270184Abstract: Embodiments of the present disclosure provide a method and device for processing objects to be searched. The method comprises: receiviug a first input indicating a constraint associated with an object; receiving a second input indicating a category to which the object belong; and establishing, based on the first input and the second input, a classification condition associating the constraint with the category as a part of a classification policy which is used for classifying the object into a category to create a search index. In addition, embodiments of the present disclosure further disclose a method and device for creating a search index for an object to be searched.Type: ApplicationFiled: March 17, 2017Publication date: September 21, 2017Inventors: Kun Wu Huang, Chao Chen, Winston Lei Zhang, Jingjing Liu, Duke Hongtao Dai
-
Publication number: 20170185671Abstract: Embodiments of the present disclosure provide a method and an apparatus for determining a similar document set to a target document from a plurality of documents. Each of the multiple documents and the target document may include a plurality of words, and each of words corresponds to a different integer. The method comprises: for each document among the plurality of documents and the target document, obtaining a set of integers associated with a document based on a set of words associated with the document, converting the set of integers associated with the document into a vector with a same dimension based on a predefined conversion rule; and determining the similar document set based on differences between the corresponding vectors for the multiple documents and the vector for the target document.Type: ApplicationFiled: December 20, 2016Publication date: June 29, 2017Inventors: Sean Kun Zhao, Chao Chen, Winston Lei Zhang, Jingjing Liu, Kun Wu Huang