Patents by Inventor MINWEI FENG

MINWEI FENG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PARAMETER DATA SHARING FOR MULTI-LEARNER TRAINING OF MACHINE LEARNING APPLICATIONS

Publication number: 20240020582

Abstract: A machine receives a first set of global parameters from a global parameter server. Multiple learner processors in the machine execute an algorithm that models an entity type using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.

Type: Application

Filed: July 19, 2023

Publication date: January 18, 2024

Inventors: Minwei Feng, YUFEI REN, Yandong Wang, Li Zhang, Wei Zhang
Parameter data sharing for multi-learner training of machine learning applications

Patent number: 11748666

Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.

Type: Grant

Filed: November 10, 2016

Date of Patent: September 5, 2023

Assignee: International Business Machines Corporation

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
Storage controller acceleration for neural network training and inference

Patent number: 11138494

Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.

Type: Grant

Filed: May 2, 2017

Date of Patent: October 5, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
Method for visualizing neural network models

Patent number: 10936938

Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.

Type: Grant

Filed: December 28, 2017

Date of Patent: March 2, 2021

Assignee: International Business Machines Corporation

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
Hybrid aggregation for deep learning neural networks

Patent number: 10783437

Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.

Type: Grant

Filed: March 5, 2017

Date of Patent: September 22, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
Forecasting solar power output

Patent number: 10732319

Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.

Type: Grant

Filed: August 30, 2017

Date of Patent: August 4, 2020

Assignee: International Business Machines Corporation

Inventors: Minwei Feng, Ildar Khabibrakhmanov, Tarun Kumar, Mark A. Lavin, Kevin W. Warren, Rui Zhang, Wei Zhang
Local multicast in single-host multi-GPU machine for distributed deep learning systems

Patent number: 10614356

Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.

Type: Grant

Filed: April 24, 2017

Date of Patent: April 7, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
METHOD FOR VISUALIZING NEURAL NETWORK MODELS

Publication number: 20190205728

Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.

Type: Application

Filed: December 28, 2017

Publication date: July 4, 2019

Inventors: Minwei Feng, Yufei Ren, Yaodong Wang, Li Zhang, Wei Zhang
FORECASTING SOLAR POWER OUTPUT

Publication number: 20190064392

Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.

Type: Application

Filed: August 30, 2017

Publication date: February 28, 2019

Inventors: MINWEI FENG, ILDAR KHABIBRAKHMANOV, TARUN KUMAR, MARK A. LAVIN, KEVIN W. WARREN, RUI ZHANG, WEI ZHANG
STORAGE CONTROLLER ACCELARATION FOR NEURAL NETWORK TRAINING AND INFERENCE

Publication number: 20180322383

Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.

Type: Application

Filed: May 2, 2017

Publication date: November 8, 2018

Applicant: International Business Machines Corporation

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
LOCAL MULTICAST IN SINGLE-HOST MULTI-GPU MACHINE FOR DISTRIBUTED DEEP LEARNING SYSTEMS

Publication number: 20180307972

Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.

Type: Application

Filed: April 24, 2017

Publication date: October 25, 2018

Applicant: International Business Machines Corporation

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
HYBRID AGGREGATION FOR DEEP LEARNING NEURAL NETWORKS

Publication number: 20180253646

Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.

Type: Application

Filed: March 5, 2017

Publication date: September 6, 2018

Applicant: International Business Machines Corporation

Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
SOLAR POWER FORECASTING WITH VOLUMETRIC CONVOLUTIONAL NEURAL NETWORK

Publication number: 20180218254

Abstract: Four-dimensional (4D) weather forecast data is received which includes a plurality of weather features. The 4D weather forecast data is processed using a chain of a plurality of processing blocks of a neural network to derive one or more of the plurality of weather features. Each of the plurality of processing blocks includes a convolutional layer, an activation layer, and a pooling layer. The convolution layer associates at least one filter to a region of the 4D weather forecast data across a plurality of layers in the 4D weather forecast data. A solar power forecast is determined for a predetermined location based upon the one or more derived weather features.

Type: Application

Filed: February 2, 2017

Publication date: August 2, 2018

Applicant: International Business Machines Corporation

Inventors: Minwei Feng, Tarun Kumar, Rui Zhang, Wei Zhang
PARAMETER DATA SHARING FOR MULTI-LEARNER TRAINING OF MACHINE LEARNING APPLICATIONS

Publication number: 20180129969

Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.

Type: Application

Filed: November 10, 2016

Publication date: May 10, 2018

Inventors: MINWEI FENG, YUFEI REN, YANDONG WANG, LI ZHANG, WEI ZHANG