Patents by Inventor MINWEI FENG

MINWEI FENG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240020582
    Abstract: A machine receives a first set of global parameters from a global parameter server. Multiple learner processors in the machine execute an algorithm that models an entity type using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.
    Type: Application
    Filed: July 19, 2023
    Publication date: January 18, 2024
    Inventors: Minwei Feng, YUFEI REN, Yandong Wang, Li Zhang, Wei Zhang
  • Patent number: 11748666
    Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.
    Type: Grant
    Filed: November 10, 2016
    Date of Patent: September 5, 2023
    Assignee: International Business Machines Corporation
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Patent number: 11138494
    Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.
    Type: Grant
    Filed: May 2, 2017
    Date of Patent: October 5, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Patent number: 10936938
    Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.
    Type: Grant
    Filed: December 28, 2017
    Date of Patent: March 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Patent number: 10783437
    Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.
    Type: Grant
    Filed: March 5, 2017
    Date of Patent: September 22, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Patent number: 10732319
    Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.
    Type: Grant
    Filed: August 30, 2017
    Date of Patent: August 4, 2020
    Assignee: International Business Machines Corporation
    Inventors: Minwei Feng, Ildar Khabibrakhmanov, Tarun Kumar, Mark A. Lavin, Kevin W. Warren, Rui Zhang, Wei Zhang
  • Patent number: 10614356
    Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.
    Type: Grant
    Filed: April 24, 2017
    Date of Patent: April 7, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Publication number: 20190205728
    Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.
    Type: Application
    Filed: December 28, 2017
    Publication date: July 4, 2019
    Inventors: Minwei Feng, Yufei Ren, Yaodong Wang, Li Zhang, Wei Zhang
  • Publication number: 20190064392
    Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.
    Type: Application
    Filed: August 30, 2017
    Publication date: February 28, 2019
    Inventors: MINWEI FENG, ILDAR KHABIBRAKHMANOV, TARUN KUMAR, MARK A. LAVIN, KEVIN W. WARREN, RUI ZHANG, WEI ZHANG
  • Publication number: 20180322383
    Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.
    Type: Application
    Filed: May 2, 2017
    Publication date: November 8, 2018
    Applicant: International Business Machines Corporation
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Publication number: 20180307972
    Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.
    Type: Application
    Filed: April 24, 2017
    Publication date: October 25, 2018
    Applicant: International Business Machines Corporation
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Publication number: 20180253646
    Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.
    Type: Application
    Filed: March 5, 2017
    Publication date: September 6, 2018
    Applicant: International Business Machines Corporation
    Inventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
  • Publication number: 20180218254
    Abstract: Four-dimensional (4D) weather forecast data is received which includes a plurality of weather features. The 4D weather forecast data is processed using a chain of a plurality of processing blocks of a neural network to derive one or more of the plurality of weather features. Each of the plurality of processing blocks includes a convolutional layer, an activation layer, and a pooling layer. The convolution layer associates at least one filter to a region of the 4D weather forecast data across a plurality of layers in the 4D weather forecast data. A solar power forecast is determined for a predetermined location based upon the one or more derived weather features.
    Type: Application
    Filed: February 2, 2017
    Publication date: August 2, 2018
    Applicant: International Business Machines Corporation
    Inventors: Minwei Feng, Tarun Kumar, Rui Zhang, Wei Zhang
  • Publication number: 20180129969
    Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.
    Type: Application
    Filed: November 10, 2016
    Publication date: May 10, 2018
    Inventors: MINWEI FENG, YUFEI REN, YANDONG WANG, LI ZHANG, WEI ZHANG