Abstract: Machine learning architectures to perform pattern recognition such as a structurally regularized convolutional neural network architecture, along with corresponding methods of operation, are provided. One such architecture includes a memory, and a processor coupled to the memory, where the processor receives data including a pattern to be recognized, decomposes the data into of sub-bands, and processes each of the sub-bands with a respective convolutional neural network (CNN) to generate outputs, where each of the CNNs operates independently of the other CNNs. The processor aggregates the outputs of the CNNs, and trains, using the aggregated output, the CNNs to recognize the pattern.
Abstract: Configurable processors for implementing CNNs are provided. One such configurable CNN processor includes a plurality of core compute circuitry elements, each configured to perform a CNN function in accordance with a preselected dataflow graph, an active memory buffer, a plurality of connections between the active memory buffer and the plurality of core compute circuitry elements, each established in accordance with the preselected dataflow graph, a plurality of connections between the plurality of core compute circuitry elements, each established in accordance with the preselected dataflow graph, wherein the active memory buffer is configured to move data between the plurality of core compute circuitry elements via the active memory buffer in accordance with the preselected dataflow graph.
Abstract: Systems and methods for performing direct conversion of image sensor data to image analytics are provided. One such system for directly processing sensor image data includes a sensor configured to capture an image and generate corresponding image data in a raw Bayer format, and a convolution neural network (CNN) coupled to the sensor and configured to generate image analytics directly from the image data in the raw Bayer format. Systems and methods for training the CNN are provided, and may include a generative model that is configured to convert RGB images into estimated images in the raw Bayer format.
Abstract: Convolutional neural network compilers for programmable functional array processors are provided. One such compiler involves a method for fitting a convolutional neural network (CNN) to a CNN processor to be performed by a compiler, the method comprising: receiving a CNN; converting the CNN into a CNN graph; converting the CNN graph into a memory graph comprising graph primitives corresponding to a plurality of components of the CNN processor including a primary memory; performing a memory analysis to determine an amount of memory required in the primary memory for at least one of the graph primitives; identifying a plurality of tokens within the memory graph to form a token graph, each of the plurality of tokens comprising one or more of the graph primitives; and generating, using the plurality of identified tokens, configuration settings for each of the plurality of components of the CNN processor.