Patents by Inventor Zhuo Cai

Zhuo Cai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatically generating defect data of printed matter for flaw detection

Patent number: 11967055

Abstract: Technology for inspection for detecting a defect of a printed matter using machine logic informed by machine learning. Some embodiments of the present invention may include one, or more, of the following features: (i) generates defect datasets; (ii) generates defect libraries; (iii) uses the generated defect libraries for deep learning training; and (iv) uses machine learning to detect defects using computer code (for example, a *.jpg format file) corresponding to an image of a piece of printed matter instead of using a visual image (that is, an image of the type that is created when a person takes a picture using a traditional film camera).

Type: Grant

Filed: June 30, 2021

Date of Patent: April 23, 2024

Assignee: International Business Machines Corporation

Inventors: Zhuo Cai, Chao Xin, Dan Zhang, Hong Bing Zhang, De Bo Xiong
Automatic appending of subtitles based on media context

Patent number: 11936958

Abstract: A processor may automatically generate one or more transcripts based on a media context. The processor may append at least one of the one or more transcripts to the media. The processor may modify the at least one of the one or more transcripts based on an adjustment to a weight factor.

Type: Grant

Filed: July 28, 2021

Date of Patent: March 19, 2024

Assignee: International Business Machines Corporation

Inventors: Jian Dong Yin, Wen Wang, Zhuo Cai, Rong Fu, Hao Sheng, Kang Zhang
Cognitive enabled blockchain based resource prediction

Patent number: 11836656

Abstract: A method, computer system, and a computer program product for blockchain based resource predictions and management is provided. Embodiments of the present invention may include receiving a request for a prediction of a future resource requirement. Embodiments of the present invention may include loading data structures. Embodiments of the present invention may include classifying collected data. Embodiments of the present invention may include predicting the future resource requirement. Embodiments of the present invention may include adjusting the priority of the future resource requirement. Embodiments of the present invention may include providing notifications.

Type: Grant

Filed: September 6, 2019

Date of Patent: December 5, 2023

Assignee: International Business Machines Corporation

Inventors: Zhuo Cai, Bing Xin Wang, Kushal Patel, Sarvesh S. Patel
Subtitle generation using background information

Patent number: 11646030

Abstract: A video is received. One or more subtitles are determined for the video. Whether a word found in a background of the video is similar to a word found in the one or more subtitles is determined. Responsive to determining the word found in the background of the video is similar to the word found in the one or more subtitles, one or more updated subtitles are generated. The one or more updated subtitles include the word found in the background of the video and remove the word found in the one or more subtitles that is similar. A metric for the one or more updated subtitles is calculated. Whether the metric is larger than a threshold is determined. Responsive to determining the metric is larger than the threshold, the video is updated to include the one or more updated subtitles.

Type: Grant

Filed: July 7, 2020

Date of Patent: May 9, 2023

Assignee: International Business Machines Corporation

Inventors: Zhuo Cai, Wen Wang, Jian Dong Yin, Rong Fu, Hao Sheng, Kang Zhang
Processing irregularly arranged characters

Patent number: 11574456

Abstract: Aspects of the present disclosure relate to processing irregularly arranged characters. An image is received. An irregularly arranged character within the image is detected. A direction of the irregularly arranged character is modified to a proper direction to obtain a properly oriented character. The properly oriented character is recognized to obtain a first identified character. The image is then rebuilt by replacing the irregularly arranged character with the first identified character, the first identified character in a machine-encoded format.

Type: Grant

Filed: October 7, 2019

Date of Patent: February 7, 2023

Assignee: International Business Machines Corporation

Inventors: Zhuo Cai, Jian Dong Yin, Wen Wang, Rong Fu, Hao Sheng, Kang Zhang
Logical channel management in a communication system

Patent number: 11576181

Abstract: Embodiments of the present disclosure relate to logical channel management in a communication network. In an embodiment, a mapping between a plurality of logical channels of at least one terminal device and a plurality of resource sets of a network device is determined. The resource sets are assigned for communication between the at least one terminal device and the network device via the logical channels. If at least one resource set is overloaded, at least one of the plurality of logical channels is determined based on the mapping. Status information indicating that the at least one logical channel is in a congestion status is caused to be transmitted to a target terminal device of the at least one terminal device, the target terminal device communicating with the network device via the at least one logical channel.

Type: Grant

Filed: August 10, 2020

Date of Patent: February 7, 2023

Assignee: International Business Machines Corporation

Inventors: Zhuo Cai, Kushal S. Patel, Sarvesh S. Patel, Bing Xin Wang
AUTOMATIC APPENDING OF SUBTITLES BASED ON MEDIA CONTEXT

Publication number: 20230030342

Abstract: A processor may automatically generate one or more transcripts based on a media context. The processor may append at least one of the one or more transcripts to the media. The processor may modify the at least one of the one or more transcripts based on an adjustment to a weight factor.

Type: Application

Filed: July 28, 2021

Publication date: February 2, 2023

Inventors: Jian Dong Yin, Wen Wang, Zhuo Cai, Rong Fu, Hao Sheng, Kang Zhang
AUTOMATICALLY GENERATING DEFECT DATA OF PRINTED MATTER FOR FLAW DETECTION

Publication number: 20230004814

Abstract: Technology for inspection for detecting a defect of a printed matter using machine logic informed by machine learning. Some embodiments of the present invention may include one, or more, of the following features: (i) generates defect datasets; (ii) generates defect libraries; (iii) uses the generated defect libraries for deep learning training; and (iv) uses machine learning to detect defects using computer code (for example, a *.jpg format file) corresponding to an image of a piece of printed matter instead of using a visual image (that is, an image of the type that is created when a person takes a picture using a traditional film camera).

Type: Application

Filed: June 30, 2021

Publication date: January 5, 2023

Inventors: Zhuo Cai, Chao Xin, Dan Zhang, Hong Bing Zhang, De Bo Xiong
Text block recognition based on discrete character recognition and text information connectivity

Patent number: 11514699

Abstract: In an approach for a text block recognition in a document, a processor detects characters in the document using an object detection technique. A processor identifies positions of the detected characters in the document. A processor analyzes semantic connectivity among the detected characters based on the positions and semantic connectivity of the characters. A processor recognizes text blocks of related characters based on the semantic connectivity analysis. A processor outputs the text blocks associated with the related characters.

Type: Grant

Filed: July 30, 2020

Date of Patent: November 29, 2022

Assignee: International Business Machines Corporation

Inventors: Zhong Fang Yuan, Zhuo Cai, Tong Liu, Yu Pan, Li Ni Zhang, Jian Long Li
Computer automated interactive activity recognition based on keypoint detection

Patent number: 11514605

Abstract: Computer automated interactive activity recognition based on keypoint detection includes retrieving, by one or more processors, a temporal sequence of image frames from a video recording. The one or more processors identify first and second keypoints in each of the image frames in the temporal sequence using machine learning techniques. The first keypoints are associated with an object in the temporal sequence of image frames while the second keypoints are associated with an individual interacting with the object. The one or more processors combine the first keypoints with the second keypoints and extract spatial-temporal features from the combination that are used to train a classification model based on which interactive activities can be recognized.

Type: Grant

Filed: September 29, 2020

Date of Patent: November 29, 2022

Assignee: International Business Machines Corporation

Inventors: Dan Zhang, Hong Bing Zhang, Chao Xin, Xue Ping Liu, Zhi Xing Peng, Zhuo Cai
Knowledge graph-based query in artificial intelligence chatbot with base query element detection and graph path generation

Patent number: 11328181

Abstract: Generating a query result utilizing a knowledge graph in an artificial intelligence chatbot is provided. Characteristics of a query are identified. The characteristics of the query are mapped to base elements of the knowledge graph in the artificial intelligence chatbot. A set of query paths are generated in the knowledge graph based on the mapping of the characteristics of the query to the base elements of the knowledge graph. One or more query paths in the set of query paths in the knowledge graph are validated based on a respective score of each query path. A query result corresponding to the query is generated based on the validated one or more query paths in the knowledge graph.

Type: Grant

Filed: August 26, 2019

Date of Patent: May 10, 2022

Assignee: International Business Machines Corporation

Inventors: Wen Wang, Jian Dong Yin, Zhuo Cai, Rong Fu, Hao Sheng, Kang Zhang
Determining image defects using image comparisons

Patent number: 11321822

Abstract: A method, computer system, and a computer program product for analyzing visual defects is provided. The present invention may include generating a template image. The present invention may include capturing a test image. The present invention may include performing an image registration between the template image and the test image. The present invention may include generating a registered test image. The present invention may include performing an image difference analysis between the registered test image and the template image. The present invention may include generating a differential image. The present invention may include synthesizing the registered, differential image, and template image. The present invention may include generating a synthetic image. The present invention may include inputting the synthetic image into a multi-scale detection network. The present invention may include generating a defect map.

Type: Grant

Filed: July 30, 2020

Date of Patent: May 3, 2022

Assignee: International Business Machines Corporation

Inventors: Chao Xin, Zhuo Cai, Hong Bing Zhang, Dan Zhang, Guang Qing Zhong
Dynamic workflow with knowledge graphs

Patent number: 11314812

Abstract: Disclosed embodiments provide techniques for computerized technical support. A knowledge graph for a computer application is established. An input query from a user is processed to extract entities used as action identifiers. One or more nodes within the knowledge graph are identified, along with corresponding relationship edges leading to the nodes. When multiple candidate nodes are found that contain information relevant to the input query, a custom clarification statement is created based on the one or more identified relationship edges. The user provides answers to the clarification statement to narrow down which nodes contain the most relevant information. This process may continue, eliminating nodes based on user responses, until a single node remains, corresponding to an action identifier. The action identifier includes action description information that provides technical assistance to a user.

Type: Grant

Filed: December 4, 2019

Date of Patent: April 26, 2022

Assignee: International Business Machines Corporation

Inventors: Wen Wang, Guang Qing Zhong, Yi Ming Wang, Jian Dong Yin, Zhuo Cai, Rong Fu, Kang Zhang, Hao Sheng
COMPUTER AUTOMATED INTERACTIVE ACTIVITY RECOGNITION BASED ON KEYPOINT DETECTION

Publication number: 20220101556

Abstract: Computer automated interactive activity recognition based on keypoint detection includes retrieving, by one or more processors, a temporal sequence of image frames from a video recording. The one or more processors identify first and second keypoints in each of the image frames in the temporal sequence using machine learning techniques. The first keypoints are associated with an object in the temporal sequence of image frames while the second keypoints are associated with an individual interacting with the object. The one or more processors combine the first keypoints with the second keypoints and extract spatial-temporal features from the combination that are used to train a classification model based on which interactive activities can be recognized.

Type: Application

Filed: September 29, 2020

Publication date: March 31, 2022

Inventors: Dan Zhang, Hong Bing Zhang, Chao Xin, Xue Ping Liu, Zhi Xing Peng, Zhuo Cai
LOGICAL CHANNEL MANAGEMENT IN A COMMUNICATION SYSTEM

Publication number: 20220046647

Abstract: Embodiments of the present disclosure relate to logical channel management in a communication network. In an embodiment, a mapping between a plurality of logical channels of at least one terminal device and a plurality of resource sets of a network device is determined. The resource sets are assigned for communication between the at least one terminal device and the network device via the logical channels. If at least one resource set is overloaded, at least one of the plurality of logical channels is determined based on the mapping. Status information indicating that the at least one logical channel is in a congestion status is caused to be transmitted to a target terminal device of the at least one terminal device, the target terminal device communicating with the network device via the at least one logical channel.

Type: Application

Filed: August 10, 2020

Publication date: February 10, 2022

Inventors: Zhuo Cai, Kushal S. Patel, Sarvesh S. Patel, Bing Xin WANG
TEXT BLOCK RECOGNITION BASED ON DISCRETE CHARACTER RECOGNITION AND TEXT INFORMATION CONNECTIVITY

Publication number: 20220036062

Abstract: In an approach for a text block recognition in a document, a processor detects characters in the document using an object detection technique. A processor identifies positions of the detected characters in the document. A processor analyzes semantic connectivity among the detected characters based on the positions and semantic connectivity of the characters. A processor recognizes text blocks of related characters based on the semantic connectivity analysis. A processor outputs the text blocks associated with the related characters.

Type: Application

Filed: July 30, 2020

Publication date: February 3, 2022

Inventors: Zhong Fang Yuan, Zhuo Cai, Tong Liu, Yu Pan, Li Ni Zhang, Jian Long Li
DETERMINING IMAGE DEFECTS USING IMAGE COMPARISONS

Publication number: 20220036525

Abstract: A method, computer system, and a computer program product for analyzing visual defects is provided. The present invention may include generating a template image. The present invention may include capturing a test image. The present invention may include performing an image registration between the template image and the test image. The present invention may include generating a registered test image. The present invention may include performing an image difference analysis between the registered test image and the template image. The present invention may include generating a differential image. The present invention may include synthesizing the registered, differential image, and template image. The present invention may include generating a synthetic image. The present invention may include inputting the synthetic image into a multi-scale detection network. The present invention may include generating a defect map.

Type: Application

Filed: July 30, 2020

Publication date: February 3, 2022

Inventors: Chao Xin, Zhuo Cai, Hong Bing Zhang, Dan Zhang, Guang Qing Zhong
SUBTITLE GENERATION USING BACKGROUND INFORMATION

Publication number: 20220013125

Abstract: A video is received. One or more subtitles are determined for the video. Whether a word found in a background of the video is similar to a word found in the one or more subtitles is determined. Responsive to determining the word found in the background of the video is similar to the word found in the one or more subtitles, one or more updated subtitles are generated. The one or more updated subtitles include the word found in the background of the video and remove the word found in the one or more subtitles that is similar. A metric for the one or more updated subtitles is calculated. Whether the metric is larger than a threshold is determined. Responsive to determining the metric is larger than the threshold, the video is updated to include the one or more updated subtitles.

Type: Application

Filed: July 7, 2020

Publication date: January 13, 2022

Inventors: ZHUO CAI, WEN WANG, JIAN DONG YIN, RONG FU, HAO SHENG, KANG ZHANG
EXTRACTING CONTENT FROM AS DOCUMENT USING VISUAL INFORMATION

Publication number: 20220012421

Abstract: An aspect of the present invention discloses a method for extracting content from a document. The method includes one or more processors identifying a visual anchor corresponding to a text element depicted in a first document utilizing an edge detection analysis. The method further includes determining edge coordinates of the text element depicted in the first document. The method further includes determining text at a leading edge of the text element depicted in the first document and text at a trailing edge of the text element depicted in the first document, based on the determined edge coordinates. The method further includes extracting a complete version of the text element depicted in the first document, from a plain text version of the first document, utilizing the determined text at the leading edge of the text element and the determined text at the trailing edge of the text element.

Type: Application

Filed: July 13, 2020

Publication date: January 13, 2022

Inventors: Zhong Fang Yuan, Zhuo Cai, Tong Liu, Yu Pan, Xiang Yu Yang, Dong Qin
Real-time detection and visualization of potential impairments in under-floor appliances

Patent number: 11175652

Abstract: A method, computer system, and a computer program product for predictive maintenance is provided. The present invention may include recording, using an autonomous robot moving along a surface through a plurality of positions in a room, a plurality of data associated with an under-floor appliance provided beneath the surface of the room. The present invention may also include calculating, based on the recorded plurality of data associated with the under-floor appliance provided beneath the surface of the room, a material composition associated with the plurality of positions in the room. The present invention may further include generating, based on the calculated material composition associated with the plurality of positions in the room, a layout diagram for visualizing a layout of the under-floor appliance provided beneath the surface of the room.

Type: Grant

Filed: February 20, 2019

Date of Patent: November 16, 2021

Assignee: International Business Machines Corporation

Inventors: Hao Sheng, Rong Fu, Kang Zhang, Jian Dong Yin, Zhuo Cai, Wen Wang

1 2 next