Patents by Inventor Zhong Fang Yuan

Zhong Fang Yuan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11640388
    Abstract: Methods, computer program products, and/or systems are provided that perform the following operations: obtaining pre-check data associated with specified data nodes; calculating outliers for each specified data node, wherein the outliers are calculated based on a unit of the pre-check data associated with each specified data node; backtracking the calculated outliers for each specified data node through an associated generating data link; selecting one or more data nodes associated with a set of largest outliers; selecting one or more data links associated with the set of largest outliers; and generating potential anomaly indications based on the one or more data nodes selected and the one or more data links selected.
    Type: Grant
    Filed: April 30, 2021
    Date of Patent: May 2, 2023
    Assignee: International Business Machines Corporation
    Inventors: Xiang Yu Yang, Deng Xin Luo, Ye Wang, Yu Pan, Zhong Fang Yuan, Miao Guo
  • Publication number: 20230127907
    Abstract: Embodiments of the present disclosure relate to question answering. A computer-implemented method includes determining a plurality of intention candidates of a user from the user's question; determining a set of entities and attributes associated with the set of entities from the plurality of intention candidates; constructing a decision tree from the set of entities and the attributes associated with the set of entities, wherein each node of the decision tree is associated with a respective one of the attributes and represents a respective subset of the plurality of intention candidates, and wherein the respective subset of the plurality of intention candidates are split based on the entities associated with the respective one of the attributes; and generating a question corresponding to a node of the decision tree to determine the user's intention.
    Type: Application
    Filed: October 22, 2021
    Publication date: April 27, 2023
    Inventors: Zhong Fang Yuan, Tong Liu, Li Juan Gao, Yi Chen Zhong, Hai Bo Zou
  • Publication number: 20230095180
    Abstract: An approach is provided for optimizing a feedback-type question answering process. A training set is constructed to detect missing information of a question. A natural language generation model is trained using the missing information. The natural language generation model is executed to generate a rhetorical question. A response to the rhetorical question is combined with the question to generate an input to a language processor. A new question is generated. The new question is applied to a document library. A final answer is generated.
    Type: Application
    Filed: September 29, 2021
    Publication date: March 30, 2023
    Inventors: Zhong Fang Yuan, Tong Liu, Chen Gao, Xiang Yu Yang
  • Publication number: 20230103033
    Abstract: Methods, apparatus, computer program products for two-phased medical diagnosis are provided. The computer-implemented method comprises, receiving, by one or more processors, data during a process of a medical diagnosis from a source of data information. The computer-implemented method also comprises extracting, by one or more processors, features from the received data. The computer-implemented method also comprises transferring, by one or more processors, the extracted features in form of feature vectors to a server via a network. The computer-implemented method further comprises obtaining, by one or more processors, a recommendation of medical diagnosis from the server, wherein the recommendation of medical diagnosis is based, at least in part, on labels determined for the feature vectors.
    Type: Application
    Filed: September 24, 2021
    Publication date: March 30, 2023
    Inventors: Zhong Fang Yuan, Xiang Yu Yang, Tong Liu, Han Ying Song, Ting LM Li
  • Publication number: 20230090993
    Abstract: A method, computer program product and computer system to provide topic guide during document drafting is provided. A processor retrieves at least one section of text from a document. A processor receives a target topic for the document. A processor extracts at least one local topic from the at least one section of text. A processor generates a semantic network comprising the at least one local topic and the target topic. A processor determines a deviation value for the at least one local topic based on a distance between the at least one local topic and the target topic in the semantic network. A processor, in response to the deviation value exceeding a threshold value, alerts a user that the at least one section of text from the document is off-topic from the target topic.
    Type: Application
    Filed: September 23, 2021
    Publication date: March 23, 2023
    Inventors: Xiang Yu Yang, Wen Jie Hao, Zhong Fang Yuan, Wang Hu Dang, Deng Xin Luo, Jia Yong Xie, Wen Wang
  • Publication number: 20230083195
    Abstract: A method, computer program product, and computer system for repairing a Dockerfile. Library versions containing initial version numbers of libraries are extracted from the Dockerfile. A Monte Carlo tree search (MCTS) is executed, using the extracted library versions as input, which generates a tree that includes multiple levels populated with noses. Each node in a level represents the generic library name of a library version in the Dockrerfile and an associated randomly selected version number. At least one of the randomly selected version numbers associated with at least one node in a level differs from the initial version number associated with a versionf. A best successful installation path is selected from the at least one successful installation path. The Dockerfile is repaired by inserting randomly selected version numbers into Dockerfile as replacements for some of the initial version numbers.
    Type: Application
    Filed: September 16, 2021
    Publication date: March 16, 2023
    Inventors: Xiang Yu Yang, Yong Wang, Zhong Fang Yuan, Deng Xin Luo, Ye Wang, Zhi Yong Jia
  • Publication number: 20230073932
    Abstract: A computer-implemented method, according to one embodiment, includes: receiving an image having characters that correspond to a language, and using a text recognition algorithm to determine a first language believed to correspond to the characters. A first confidence level associated with the first language is also computed, and a determination is made as to whether the first confidence level associated with the first language is outside a predetermined range. In response to determining that the first confidence level associated with the first language is not outside the predetermined range, the first language is output as the given language. The text recognition algorithm is trained using a simple shallow neural network and a generated mixed language corpus. The generated mixed language corpus is formed by: randomly sampling libraries having vocabulary and/or characters therein, and combining the randomly sampled vocabulary and/or characters to form the generated mixed language corpus.
    Type: Application
    Filed: September 7, 2021
    Publication date: March 9, 2023
    Inventors: Zhong Fang Yuan, Tong Liu, Li Juan Gao, Xiang Yu Yang, Qiang He, Yu Pan
  • Publication number: 20230072003
    Abstract: A system, method, and computer program product for implementing cognitive natural language processing software framework optimization is provided. The method includes receiving instructions associated with an audible user input of a user. An AI input intention of the user is determined and key information is extracted from the audible user input. The key information is inputted into a generated database table and additional key information is retrieved from a dialog table. A supplementary database table comprising the additional key information is generated and the key information is spliced with the additional key information. A resulting spliced data structure is merged into a final database table and natural language is converted into a request code structure within an SQL structure and an interactive AI interface presenting results of the converting is generated. Operational functionality of an AI device is enabled for audibly presenting results of the conversion.
    Type: Application
    Filed: September 7, 2021
    Publication date: March 9, 2023
    Inventors: Zhong Fang Yuan, Tong Liu, De Shuo Kong, Yao Chen, Hai Bo Zou, Sarbajit K. Rakshit, Zheng Jie
  • Publication number: 20230076923
    Abstract: In order to perform a semantic search based on a graph database, sets of nodes are selected from a plurality of nodes in a graph database. A set of nodes semantically matches a keyword in a natural language query. At least one target node is identified in the sets of nodes. A path is selected from candidate paths based on similarities between the candidate paths and a plurality of paths in the graph database. A graph query for retrieving information from the graph database is generated based on the selected path and the query target.
    Type: Application
    Filed: September 7, 2021
    Publication date: March 9, 2023
    Inventors: Teng Sun, Tong Liu, Si Tong Zhao, XueLiang Zhao, Frank Feng, Yu Zui WY You, Zhong Fang Yuan
  • Patent number: 11557284
    Abstract: A method, system and computer program product for speech recognition using multiple languages includes receiving, by one or more processors, an input from a user, the input includes a sentence in a first language. The one or more processors translate the sentence to a plurality of languages different than the first language, and create vectors associated with the plurality of languages, each vector includes a representation of the sentence in each of the plurality of languages. The one or more processors calculate eigenvectors for each vector associated with a language in the plurality of languages, and based on the calculated eigenvectors, a score is assigned to each of the plurality of languages according to a relevance for determining a meaning of the sentence.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: January 17, 2023
    Assignee: International Business Machines Corporation
    Inventors: Zhong Fang Yuan, Kun Yan Yin, He Li, Tong Liu, Hai Ji
  • Publication number: 20230004750
    Abstract: The embodiments of the present disclosure disclose a computer-implemented method, computer system and a computer program product for detecting and predicting an abnormal log event. In the method, a current event cluster from a plurality of event clusters for a log line in a log file is determined. The plurality of event clusters include at least one abnormal event cluster. Then, a time of event transition from the current event cluster to at least one abnormal event cluster is predicted.
    Type: Application
    Filed: June 30, 2021
    Publication date: January 5, 2023
    Inventors: Yi Ming Wang, Hui Dong, Zhong Fang Yuan, Tong Liu, Yan Fen Liu, Ling Chen
  • Publication number: 20220405524
    Abstract: A method, computer system, and a computer program product for optical character recognition training are provided. A text image and plain text labels for the text image may be received. The text image may include words. The plain text labels may include machine-encoded text corresponding to the words. Semantic feature vectors for the words, respectively, may be generated based on the plain text label. The text image, the plain text labels, and the semantic feature vectors may be input together into a machine learning model to train the machine learning model for optical character recognition. The plain text labels and the semantic feature vectors may be constraints for the training.
    Type: Application
    Filed: June 17, 2021
    Publication date: December 22, 2022
    Inventors: Zhong Fang Yuan, Tong Liu, Jing Wen Xu, Xiang Yu Yang, Yu Pan, Wei NB Wu
  • Publication number: 20220404193
    Abstract: Reducing an average give-away rate of a weighing device by determining a weight of a product of a weighing device that includes an article, determining one or more conditions of an environment of the weighing device, determining a state of the environment of the weighing device, wherein the state relates to an average give-away rate of the environment of the weighing device, determining a reward value for the state of the environment of the weighing device, wherein the reward value is based at least in part on the weight of the product, and generating a set of parameters for the weighing device based at least in part on the environment, the state, and the reward.
    Type: Application
    Filed: June 17, 2021
    Publication date: December 22, 2022
    Inventors: Deng Xin Luo, Xiang Yu Yang, Yong Wang, Ye Wang, Zhong Fang Yuan, Yu Pan
  • Publication number: 20220391183
    Abstract: Techniques are provided for mapping natural language to code segments. In one embodiment, the techniques involve receiving a document and software code, wherein the document comprises a natural language description of a use of the code, generating, via a vectorization process performed on the document, at least one vector or word embedding, generating, via a natural language processing technique performed on the at least one vector or word embedding, a first label set, generating, via a machine learning analysis of the software code, a second label set, determining, based on a comparison of the first label set and the second label set, a match confidence between the document and the software code, wherein the match confidence indicates a measure of similarity between the first label set and the second label set, and upon determining that the match confidence exceeds a predefined threshold, mapping the document to the software code.
    Type: Application
    Filed: June 3, 2021
    Publication date: December 8, 2022
    Inventors: Zhong Fang YUAN, Bin SHANG, Li Ni ZHANG, Yong Fang LIANG, Chen GAO, Tong LIU
  • Patent number: 11521602
    Abstract: A set of candidate intent vectors is generated from an input intent vector. A validation of the set of candidate intent vectors is performed that selects as valid intent vectors any of the set of candidate intent vectors that are semantically similar to the input intent vector.
    Type: Grant
    Filed: May 10, 2021
    Date of Patent: December 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Zhong Fang Yuan, Kun Yan Yin, Yuan Lin Yang, Tong Liu, He Li
  • Patent number: 11514699
    Abstract: In an approach for a text block recognition in a document, a processor detects characters in the document using an object detection technique. A processor identifies positions of the detected characters in the document. A processor analyzes semantic connectivity among the detected characters based on the positions and semantic connectivity of the characters. A processor recognizes text blocks of related characters based on the semantic connectivity analysis. A processor outputs the text blocks associated with the related characters.
    Type: Grant
    Filed: July 30, 2020
    Date of Patent: November 29, 2022
    Assignee: International Business Machines Corporation
    Inventors: Zhong Fang Yuan, Zhuo Cai, Tong Liu, Yu Pan, Li Ni Zhang, Jian Long Li
  • Patent number: 11501550
    Abstract: A method, system, and computer program product for segmenting and processing documents for optical character recognition is provided. The method includes receiving a document and detecting different types of text data. The document is divided into a plurality of text regions associated with the different types of said text data. Optical noise is removed from each text region and differing optical character recognition software code is selected for application to each text region. The differing optical character recognition software code is executed with respect to each text region resulting in extractable computer readable text located within each said text region.
    Type: Grant
    Filed: November 24, 2020
    Date of Patent: November 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Zhong Fang Yuan, Yu Pan, Tong Liu, Yi Chen Zhong, Li Juan Gao, Qiong Wu, Dan Dan Wu
  • Publication number: 20220350789
    Abstract: Methods, computer program products, and/or systems are provided that perform the following operations: obtaining pre-check data associated with specified data nodes; calculating outliers for each specified data node, wherein the outliers are calculated based on a unit of the pre-check data associated with each specified data node; backtracking the calculated outliers for each specified data node through an associated generating data link; selecting one or more data nodes associated with a set of largest outliers; selecting one or more data links associated with the set of largest outliers; and generating potential anomaly indications based on the one or more data nodes selected and the one or more data links selected.
    Type: Application
    Filed: April 30, 2021
    Publication date: November 3, 2022
    Inventors: Xiang Yu Yang, Deng Xin Luo, Ye Wang, Yu Pan, Zhong Fang Yuan, Miao Guo
  • Patent number: 11455322
    Abstract: Described are techniques for determining statistical properties of time series data. The techniques include a method comprising graphing, from time series data, a time series data graph. The method further comprises iteratively segmenting the time series data graph into respective pluralities of subgraphs using respective segmentation schemes until a first plurality of subgraphs generated by a first segmentation scheme exhibits a similarity between respective subgraphs of the first plurality of subgraphs satisfying a similarity threshold. The first segmentation scheme can be selected from: an equidistant segmentation scheme, a local extrema segmentation scheme, and a windowed segmentation scheme. The method further comprises associating a classification to the time series data based on the first segmentation scheme. The classification can be indicative of one selected from: stationarity of the time series data, periodicity of the time series data, and trending of the time series data.
    Type: Grant
    Filed: May 12, 2020
    Date of Patent: September 27, 2022
    Assignee: International Business Machines Corporation
    Inventors: Xiang Yu Yang, Deng Xin Luo, Jing Du, Zhong Fang Yuan, Tong Liu, Li Jia Lu
  • Patent number: 11455812
    Abstract: An approach for extracting non-textual data from an electronic document is disclosed. The approach includes receiving a request to extract a file and converting the file into pixels. The approach creates a pixel map of the converted file and determines one or more density clusters of the pixel map based on image clustering method. Furthermore, the approach determines one or more coordinates of the one or more density clusters and determines one or more candidate information regions based on the one or more coordinates, density of the one or more density clusters. Finally, the approach extracts one or more textual data based on the one or more candidate information regions and outputs the extracted one or more textual data.
    Type: Grant
    Filed: March 13, 2020
    Date of Patent: September 27, 2022
    Assignee: International Business Machines Corporation
    Inventors: Zhong Fang Yuan, Guang Qing Zhong, Tong Liu, De Shuo Kong, Yi Ming Wang