Patents by Inventor MINWEI FENG
MINWEI FENG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240020582Abstract: A machine receives a first set of global parameters from a global parameter server. Multiple learner processors in the machine execute an algorithm that models an entity type using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.Type: ApplicationFiled: July 19, 2023Publication date: January 18, 2024Inventors: Minwei Feng, YUFEI REN, Yandong Wang, Li Zhang, Wei Zhang
-
Patent number: 11748666Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.Type: GrantFiled: November 10, 2016Date of Patent: September 5, 2023Assignee: International Business Machines CorporationInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Patent number: 11138494Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.Type: GrantFiled: May 2, 2017Date of Patent: October 5, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Patent number: 10936938Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.Type: GrantFiled: December 28, 2017Date of Patent: March 2, 2021Assignee: International Business Machines CorporationInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Patent number: 10783437Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.Type: GrantFiled: March 5, 2017Date of Patent: September 22, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Patent number: 10732319Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.Type: GrantFiled: August 30, 2017Date of Patent: August 4, 2020Assignee: International Business Machines CorporationInventors: Minwei Feng, Ildar Khabibrakhmanov, Tarun Kumar, Mark A. Lavin, Kevin W. Warren, Rui Zhang, Wei Zhang
-
Patent number: 10614356Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.Type: GrantFiled: April 24, 2017Date of Patent: April 7, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Publication number: 20190205728Abstract: A method for providing a graphical visualization of a neural network to a user is provided. The method includes generating the graphical visualization of the neural network at least in part by: representing layers of the neural network as respective three-dimensional blocks, wherein at least a first dimension of a given block is proportional to a computational complexity of a layer of the neural network represented by the given block; and representing data flows between the layers of the neural network as respective three-dimensional structures connecting blocks representing the layers of the neural network, wherein a first dimension of a given structure is proportional to each of a first dimension and a second dimension of a data flow represented by the given structure. The method also includes displaying the graphical visualization of the neural network to the user.Type: ApplicationFiled: December 28, 2017Publication date: July 4, 2019Inventors: Minwei Feng, Yufei Ren, Yaodong Wang, Li Zhang, Wei Zhang
-
Publication number: 20190064392Abstract: A method, computer system, and computer program product. Weather forecast data is generated with respect to an area encompassing a location of a solar farm by a computer system. Solar power output by the solar farm is forecasted by the computer system based on the generated weather forecast data. Forecasted solar power output data is generated by the computer system based on the forecasted solar power output by the solar farm. A power grid operation, including one or both of a power grid balancing operation and a power grid optimization operation, is performed based on the forecasted solar power output data.Type: ApplicationFiled: August 30, 2017Publication date: February 28, 2019Inventors: MINWEI FENG, ILDAR KHABIBRAKHMANOV, TARUN KUMAR, MARK A. LAVIN, KEVIN W. WARREN, RUI ZHANG, WEI ZHANG
-
Publication number: 20180322383Abstract: A storage controller of a machine receives training data associated with a neural network model. The neural network model includes a plurality of layers, and the machine further including at least one graphics processing unit. The storage controller trains at least one layer of the plurality of layers of the neural network model using the training data to generate processed training data. A size of the processed data is less than a size of the training data. Training of the at least one layer includes adjusting one or more weights of the at least one layer using the training data. The storage controller sends the processed training data to at least one graphics processing unit of the machine. The at least one graphics processing unit is configured to store the processed training data and train one or more remaining layers of the plurality of layers using the processed training data.Type: ApplicationFiled: May 2, 2017Publication date: November 8, 2018Applicant: International Business Machines CorporationInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Publication number: 20180307972Abstract: A network interface controller of a machine receives a packet including at least one model parameter of a neural network model from a server. The packet includes a virtual address associated with the network interface controller, and the machine further includes a plurality of graphics processing units coupled to the network interface controller by a bus. The network interface controller translates the virtual address to a memory address associated with each of the plurality of graphics processing units. The network interface controller broadcasts the at least one model parameter to the memory address associated with each of the plurality of graphics processing units.Type: ApplicationFiled: April 24, 2017Publication date: October 25, 2018Applicant: International Business Machines CorporationInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Publication number: 20180253646Abstract: A processing unit topology of a neural network including a plurality of processing units is determined. The neural network includes at least one machine in which each machine includes a plurality of nodes, and wherein each node includes at least one of the plurality of processing units. One or more of the processing units are grouped into a first group according to a first affinity. The first group is configured, using a processor and a memory, to use a first aggregation procedure for exchanging model parameters of a model of the neural network between the processing units of the first group. One or more of the processing units are grouped into a second group according to a second affinity. The second group is configured to use a second aggregation procedure for exchanging the model parameters between the processing units of the second group.Type: ApplicationFiled: March 5, 2017Publication date: September 6, 2018Applicant: International Business Machines CorporationInventors: Minwei Feng, Yufei Ren, Yandong Wang, Li Zhang, Wei Zhang
-
Publication number: 20180218254Abstract: Four-dimensional (4D) weather forecast data is received which includes a plurality of weather features. The 4D weather forecast data is processed using a chain of a plurality of processing blocks of a neural network to derive one or more of the plurality of weather features. Each of the plurality of processing blocks includes a convolutional layer, an activation layer, and a pooling layer. The convolution layer associates at least one filter to a region of the 4D weather forecast data across a plurality of layers in the 4D weather forecast data. A solar power forecast is determined for a predetermined location based upon the one or more derived weather features.Type: ApplicationFiled: February 2, 2017Publication date: August 2, 2018Applicant: International Business Machines CorporationInventors: Minwei Feng, Tarun Kumar, Rui Zhang, Wei Zhang
-
Publication number: 20180129969Abstract: A machine receives a first set of global parameters from a global parameter server. The first set of global parameters includes data that weights one or more operands used in an algorithm that models an entity type. Multiple learner processors in the machine execute the algorithm using the first set of global parameters and a mini-batch of data known to describe the entity type. The machine generates a consolidated set of gradients that describes a direction for the first set of global parameters in order to improve an accuracy of the algorithm in modeling the entity type when using the first set of global parameters and the mini-batch of data. The machine transmits the consolidated set of gradients to the global parameter server. The machine then receives a second set of global parameters from the global parameter server, where the second set of global parameters is a modification of the first set of global parameters based on the consolidated set of gradients.Type: ApplicationFiled: November 10, 2016Publication date: May 10, 2018Inventors: MINWEI FENG, YUFEI REN, YANDONG WANG, LI ZHANG, WEI ZHANG