ARTWORK GENERATED TO CONVEY DIGITAL MESSAGES, AND METHODS/APPARATUSES FOR GENERATING SUCH ARTWORK

Info

Publication number: 20190213705
Type: Application
Filed: Dec 6, 2018
Publication Date: Jul 11, 2019
Inventors: Ajith M. Kamath (Beaverton, OR), Christopher A. Ambiel (Milwaukie, OR), Utkarsh Deshmukh (Hillsboro, OR), Andi R. Castle (Tigard, OR), Christopher M. Haverkate (Newberg, OR)
Application Number: 16/212,125

Abstract

A neural network is applied to imagery including a machine readable code, to transform its appearance while maintaining its machine readability. One particular method includes training a neural network with a style image having various features. The trained network is then applied to an input pattern that encodes a plural-symbol payload. The network adapts features from the style image to express details of the input pattern, to thereby produce an output image in which features from the style image contribute to encoding of the plural-symbol payload. This output image can then be used as a graphical component in product packaging, such as a background, border, or pattern fill. In some embodiments, the input pattern is a watermark pattern, while in others it is a host image that has been previously watermarked. A great variety of other features and arrangements are also detailed.

Description

Description

RELATED APPLICATION DATA

This application claims priority to provisional applications 62/745,219, filed Oct. 12, 2018, and 62/596,730, filed Dec. 8, 2017.

The subject matter of the present application is related to that of copending U.S. application Ser. No. 15/072,884, filed Mar. 17, 2016 (published as 20170024840), Ser. No. 16/002,989, filed Jun. 7, 2018, 62/682,731, filed Jun. 8, 2018, Ser. No. 16/129,487, filed Sep. 12, 2018, and 62/751,084, filed Oct. 26, 2018.

The disclosures of the above applications are incorporated herein by reference.

TECHNICAL FIELD

The present technology concerns message signaling. The technology is particularly illustrated in the context of message signaling through artwork of grocery item packaging, to convey plural-bit messages—of the sort presently conveyed by UPC barcodes. However, the technology is not so-limited.

Background and Introduction

Barcodes are in widespread use on retail items, but take up real estate that the manufacturers would prefer to use for other purposes. Some retail brands find that printing a barcode on a product detracts from its aesthetics.

Steganographic digital watermarking is gaining adoption as an alternative to visible barcodes. Watermarking involves making subtle changes to packaging artwork to convey a multi-bit product identifier or other message. These changes are generally imperceptible to humans, but are detectable by a computer.

FIG. 1 shows an illustrative digital watermark pattern, including a magnified view depicting its somewhat mottled appearance. In actual use, the watermark pattern is scaled-down in amplitude so that, when overlaid with packaging artwork in a tiled fashion, it is an essentially transparent layer that is visually imperceptible amid the artwork details.

Designing watermarked packaging involves establishing a tradeoff between this amplitude factor, and detectability. To assure reliable detection—even under the adverse imaging conditions that are sometimes encountered by supermarket scanners—the watermark should have as strong an amplitude as possible. However, the greater the amplitude, the more apparent the pattern of the watermark becomes on the package. A best balance is struck when the watermark amplitude is raised to just below the point where the watermark pattern becomes visible to human viewers of the packaging.

FIG. 2 illustrates this graphically. When a watermark is added to artwork at low levels, human visibility of the watermark is nil. As strength of the watermark increases, there becomes a point at which humans can start perceiving the watermark pattern. Increases in watermark amplitude beyond this point result in progressively greater perception (e.g., from barely, to mild, to conspicuous, to blatant). Point “A” is the desired “sweet-spot” value, at which the amplitude of the watermark is maximum, while visibility of the watermark is still below the point of human perception.

Setting the watermark amplitude to this sweet-spot value, when creating packaging artwork (e.g., using Adobe Illustrator software), is one thing. Hitting this sweet-spot “on-press” is another.

All print technologies, being physical processes, have inherent uncertainty. Some print technologies have more uncertainty than others. Dry offset printing is one; it is notably inaccurate. (Dry offset is advantageous in other respects; for example, it works well with the tapered shapes of plastic tubs, such as for yogurt and sour cream.)

Dry offset offers only gross control of dot size and other print structures. For example, if a digital artwork file specifies that ink dots are to be laid down with a density of 15%, a dry offset press will typically deposit a much greater density of ink, e.g., with 30% density.

Printer profiles exist to characterize such behavior. A profile for a particular model of dry offset press may specify that artwork indicating a 15% density will actually be rendered with a 25% density (e.g., a 10% dot gain). But there is a great deal of variation between presses of the same model—depending on factors including age, maintenance, consumables, temperature, etc. So instead of depositing ink at a 25% density—as indicated by a printer's profile, a particular press may instead deposit ink at a 20% density. Or a 40% density. Or anything in between.

This uncertainty poses a big obstacle for use of digital watermark technology. Packaging artwork that has been carefully designed to set the watermark amplitude to the point “A” sweet-spot of FIG. 2, may instead be printed with the watermark amplitude at point “B”, making the watermark plainly visible.

Our patent publication 20110214044 teaches that, rather than attempt to hide a digital watermark signal in artwork, the payload data may be encoded as overt elements of the artwork. One example is a digital watermark in which the amplitude is set to a plainly human-visible level. Another example is a 1D or 2D black and white barcode that is used as a fill pattern, e.g., laid down by a paintbrush in Adobe Photoshop software.

These techniques often do not prove satisfactory. As illustrated by FIG. 1, a digital watermark, with its amplitude set to a plainly human-visible level, yields a pattern that does not fit well into the design of most packaging. Painted barcode fills similarly have limited practical utility.

In accordance with one aspect of the technology, a neural network is applied to imagery including a machine readable code, to transform its appearance while maintaining its machine readability. One particular method includes training a neural network with a style image having various features. The trained network is then applied to an input pattern that encodes a plural-symbol payload. The network adapts features from the style image to express details of the input pattern, to thereby produce an output image in which features from the style image contribute to encoding of the plural-symbol payload. This output image can then be used as a graphical component in product packaging, such as a background, border, or pattern fill. In some embodiments, the input pattern is a watermark pattern, while in others it is a host image that has been previously watermarked.

The foregoing and additional features and advantages of the present technology will be more readily apparent from the following detailed description, which proceeds with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

FIG. 1 is an illustration of a digital watermark pattern.

FIG. 2 is a graph showing how a digital watermark is human-imperceptible at low strengths, but becomes progressively more conspicuous at higher strengths.

FIG. 3 is a diagram depicting an algorithm for generating a digital watermark tile.

FIG. 4 details an illustrative neural network that can be used in embodiments of the present technology

FIGS. 5 and 6 further detail use of the FIG. 4 neural network in an illustrative embodiment of the technology.

FIG. 7 details an algorithm employed in one illustrative embodiment of the technology.

FIG. 8 shows an obfuscated code signal produced by the present technology from the digital watermark pattern of FIG. 1 and a (depicted) bitonal geometrical pattern.

FIG. 9 shows another obfuscated code signal produced by the present technology from the digital watermark pattern of FIG. 1 and a (depicted) bitonal geometrical pattern.

FIG. 10 shows still another obfuscated code signal produced by the present technology from the digital watermark pattern of FIG. 1 and still another (depicted) bitonal geometrical pattern.

FIG. 11 shows a line art counterpart to the image of FIG. 10, produced by the method of FIG. 12.

FIG. 12 details an algorithm for further-processing an image produced with the present technology.

FIG. 13 shows another pattern produced by methods of the present technology.

FIG. 14A depicts a user interface of software that generates a watermark pattern with user-defined parameters, and enables an artist to specify a style to be applied to the pattern.

FIGS. 14B and 14C detail exemplary methods employed by the software of FIG. 14A.

FIGS. 14D, 14E, 14F and 14G are images on which the “Candy,” “Mosaic,” “Feathers,” and “La Muse” styles visible in FIG. 14 are based.

FIGS. 15A-15ZZ depict continuous tone watermark patterns that have been processed in accordance with aspects of the present technology.

FIGS. 16A-16C show how stylization of a watermark pattern serves to scale, translate, and rotate features of a style image to mimic, or echo, features of the watermark pattern.

FIGS. 17A-17E illustrate how three patterns can be combined: a host image, a watermark pattern, and a style image.

FIGS. 18A-18C illustrate the Lenna image styled with images of candy balls, denim, and grass, respectively.

FIGS. 19A-19C illustrate that a sparse mark, rendered as a Delaunay triangulation, can be stylized with a pattern of leaves, resulting in an output image in which the leaves are morphed in scale, location and rotation to correspond to triangles of the Delaunay pattern.

FIG. 20 depicts an inverted transform method according to one aspect of the technology.

FIGS. 21A-21E depicts application of an inverted transform method in connection with a Voronoi pattern.

FIG. 22 depicts an inverted transform method according to one aspect of the technology.

FIGS. 23A and 23B show a Voronoi pattern based on a sparse pattern of dots, and the same pattern after application of the inverted transform method of FIG. 22.

FIGS. 24A and 24B are enlargements from FIGS. 23A and 23B.

FIGS. 25A and 25B illustrate how artistic signaling patterns according to the present technology can be used as background patterns on a boxed item of food.

FIGS. 26 and 27 are like FIGS. 25A and 25B, but showing application of artistic signaling patterns to food items in cans and plastic tubs.

FIGS. 28A and 28B illustrate use of artistic signaling patterns, based on continuous tone watermarks, as graphic backgrounds on a grocery item.

FIGS. 29A-29C illustrate use of artistic signaling patterns, based on sparse watermarks, as graphic backgrounds on a grocery item.

FIGS. 30A and 30B show polygon-based patterns based on continuous tone watermarks.

FIG. 31A shows an artistic signaling pattern generated from a sparse mark using various image transformations.

FIG. 31B is an enlargement from FIG. 31A.

FIGS. 32A, 32B, 33A, 33B, 34A, 34B, 35A, 35B, 36A, 36B, 37A, 37B, 38A and 38B are similar to FIGS. 31A and 31B, but using different image transformations.

FIGS. 39A and 40A show artistic signaling patterns generated from a continuous tone watermark using various image transformations, and FIGS. 39B and 40B are enlargements thereof.

FIG. 41 shows a colored artistic signaling pattern produced from a sparse mark.

FIG. 42A shows an excerpt from a watermarked poster, and FIG. 42B shows an enlargement from FIG. 42A.

FIGS. 43 and 44 show excerpts of tiles of signal-carrying patterns.

FIG. 45 shows how glyphs hand-written on a tablet can employ a signal-carrying pattern.

FIG. 46 shows how strokes on a tablet can have directional texture associated with the brush, and/or its direction of use, while also employing a signal-carrying pattern.

FIG. 47 conceptually-illustrates how two overlapping brush strokes may effect a darkening of a common region, while maintaining two directional textures.

FIG. 48 conceptually-illustrates how two overlapping brush strokes that apply a signal-carrying pattern may effect a darkening of a common region by changing mean value of the pattern.

FIG. 49A shows that the mean value of patterning in a rendering of handwritten glyphs can depend on stylus-applied pressure.

FIG. 49B is like FIG. 49A but with the patterning less-exaggerated in contrast.

FIG. 49C is like FIG. 49B, but is presented at a more typical scale.

FIGS. 50A and 50B show how handwritten glyphs entered by a stylus can be patterned with different patterns, including a neural network-derived signal-carrying pattern.

FIG. 51 shows the user interface of a prior art tablet drawing program.

FIG. 52 shows correspondence between a signal-carrying pattern, and handwritten strokes of different pressures—causing the pattern to be rendered with different mean values.

DETAILED DESCRIPTION

FIG. 3 shows an illustrative direct sequence spread spectrum watermark generator. A plural-symbol message payload (e.g., 40 binary bits, which may represent a product's Global Trade Identification Number, or GTIN) is applied to an error correction coder. This coder transforms the symbols of the message payload into a much longer array of encoded message elements (e.g., binary or M-ary elements) using an error correction method. (Suitable coding methods include block codes, BCH, Reed Solomon, convolutional codes, turbo codes, etc.) The coder output may comprise hundreds or thousands of binary bits, e.g., 4,096, which may be termed “raw bits.”

These raw bits each modulate a pseudorandom noise (carrier) sequence of length 16, e.g., by XORing. Each raw bit thus yields a 16-bit modulated carrier sequence, for an enlarged payload sequence of 65,536 elements. This sequence is mapped to elements of a square block having 256×256 embedding locations. These locations correspond to pixel samples at a configurable spatial resolution, such as 100 or 300 dots per inch (DPI). (In a 300 DPI embodiment, a 256×256 block corresponds to an area of about 0.85 inches square.) Such blocks can be tiled edge-to-edge to for printing on the paper or plastic of a product package/label.

There are several alternatives for mapping functions to map the enlarged payload sequence to embedding locations. In one, the sequence is pseudo-randomly mapped to the embedding locations. In another, they are mapped to bit cell patterns of differentially encoded bit cells as described in published patent application 20160217547. In the latter, the block size may be increased to accommodate the differential encoding of each encoded bit in a pattern of differential encoded bit cells, where the bit cells correspond to embedding locations at a target resolution (e.g., 300 DPI).

The enlarged payload sequence of 65,536 elements is a bimodal signal, e.g., with values of 0 and 1. It is mapped to a larger bimodal signal centered at 128, e.g., with values of 95 and 161.

A synchronization signal is commonly included in a digital watermark, to help discern parameters of any affine transform to which the watermark may have been subjected prior to decoding (e.g., by image capture with a camera having an oblique view of the pattern), so that the payload can be correctly decoded. A particular synchronization signal (sometimes termed a calibration signal, or registration signal, or grid signal) comprises a set of dozens of magnitude peaks or sinusoids of pseudorandom phase, in the Fourier domain. This signal is transformed to the spatial domain in a 256×256 block size (e.g., by an inverse Fast Fourier transform), corresponding to the 256×256 embedding locations to which the enlarged payload sequence is mapped. (Spatial domain values larger than a threshold value, e.g., 40, are clipped to that threshold, and likewise with values smaller than a negative threshold value, e.g., −40.)

This spatial domain synchronization signal is summed with the block-mapped 256×256 payload sequence to yield a final watermark signal block.

In another embodiment, a block of size 128×128 is generated, e.g., with a coder output of 1024 bits, and a payload sequence that maps each of these bits 16 times to 16,384 embedding locations.

The just-described watermark signal may be termed a “continuous tone” watermark signal. It is usually characterized by multi-valued data, i.e., not being just on/off (or 1/0, or black/white)—thus the “continuous” moniker. Each pixel of the host image (or of a region within the host image) is associated with one corresponding element of the watermark signal. A majority of the pixels in the image (or image region) are changed in value by combination with their corresponding watermark elements. The changes are typically both positive and negative, e.g., changing the local luminance of the imagery up in one location, while changing it down in another. And the changes may be different in degree—some pixels are changed a relatively smaller amount, while other pixels are changed a relatively larger amount. Typically the amplitude of the watermark signal is low enough that its presence within the image escapes notice by casual viewers (i.e., it is steganographic).

In a variant continuous tone watermark, the signal acts not to change the local luminance of artwork pixels, but rather their color. Such a watermark is termed a “chrominance” watermark (instead of a “luminance” watermark). An example is detailed, e.g., in U.S. Pat. No. 9,245,308, the disclosure of which is incorporated herein by reference.

“Sparse” or “binary” watermarks are different from continuous tone watermarks in several respects. They do not change a majority of pixel values in the host image (or image region). Rather, they have a print density (which may sometimes be set by the user) that results in marking between 5% and 45% of pixel locations in the image. Adjustments are all made in the same direction, e.g., reducing luminance. Sparse elements are typically bitonal, e.g., being either white or black. Although sparse watermarks may be formed on top of other imagery, they are usually presented in regions of artwork that are blank, or colored with a uniform tone. In such cases a sparse marking may contrast with its background, rendering the marking visible to casual viewers. As with continuous tone watermarks, sparse watermarks generally take the form of signal blocks that are tiled across an area of imagery.

Exemplary sparse marking technology is detailed in the patent documents referenced at the beginning of this specification.

A sparse pattern can be rendered in various forms. Most straight-forward is as a seemingly-random pattern of dots. But more artistic renderings are possible, including Voronoi, Delaunay, and stipple half-toning. Such techniques are detailed in above-cited application 62/682,731. (In a simple Voronoi pattern, a line segment is formed between a dot and each of its neighboring dots, for all the dots in the input pattern. Each line segment extends until it intersects with another such line segment. Each dot is thus surrounded by a polygon.)

Unless otherwise indicated, the techniques described below can be applied to any type of watermarks—continuous tone luminance, continuous tone chrominance, sparse dot, Voronoi, Delaunay, stipple, etc., as well as to images incorporating such watermarks.

First Exemplary Embodiment

A first exemplary embodiment of the present technology processes a watermark signal block, such as described above, with a neural network. The network can be of various designs. One, shown in FIG. 4, is based on the VGG network described in Simonyan et al, Very Deep Convolutional Networks for Large-Scale Image Recognition, arXiv preprint 1409.1556v6, Apr. 10, 2015 (familiar to those skilled in that art, and attached to application 62/596,730), but lacking its three fully-connected layers, and its SoftMax classifier.

The network of FIG. 4 is characterized by five stacks of convolutional layers, each based on 3×3 receptive fields. These 3×3 filters are convolved with their inputs at every pixel, i.e., with a stride of 1. The first stack includes two cascaded convolutional layers, of 64 channels each. The second stack again includes two cascaded convolutional layers, but of 128 channels each. The third stack includes four cascaded convolutional layers, of 256 channels each. The fourth and fifth stacks include four cascaded convolutional layers, of 512 channels each. Each layer includes a Rectified Linear Unit (ReLu) stage to zero any negative values in the convolutional outputs. Each stack additionally includes a concluding average pooling layer (rather than the max pooling layer taught by Simonyan), which is indicated in FIG. 4 by the shaded layer at the right of each stack. The input image is an RGB image of size 224×224 pixels.

In its original conception, the VGG network was designed to identify images, e.g., to determine into which of several classes (dog, cat, car, etc.) an image belongs. And the network here is configured for this purpose—employing filter kernels that have been trained to classify an input image into one of the one thousand classes of the ImageNet collection. However, the network is here used for a different purpose. It is here used to discern “features” that help characterize a watermark image. And it is used again to discern features that characterize a geometrical pattern image. And it is used a third time to characterize features of a test image (which may be a frame of pseudo-random noise). This test image is then adjusted to drive its features to more closely correspond both to features of the watermark image and to features of the geometrical pattern image. The adjusted test image is then re-submitted to the network, and its features are re-determined. Adjustment of the test image continues in an iterative fashion until a final form is reached that incorporates geometrical attributes of the geometrical pattern image, while also being decodable by a digital watermark decoder.

FIGS. 5 and 6 help further explain this process. A watermark signal block (A) is input to the network. Each of the stacks of convolutional layers produces filter responses, determined by applying the convolutional parameters of their 3×3 filters on respective excerpts of input data. These filter responses are stored as reference features for the watermark signal block, and will later serve as features towards which features of the third image are to be steered.

A similar operation is applied to the geometrical pattern image (B). The geometrical pattern image is applied to the network, and each of the stacks produces corresponding filter responses. In this case, however, these filter responses are processed to determine correlations between different filter channels of a layer. This correlation data, which is further detailed below, serves as feature data indicating the style, or texture, of the geometrical pattern image. This correlation data is stored, and is later used as a target towards which similarly-derived correlation data from the test image is to be steered.

A test image (C) is next applied to the network. The starting test image can be of various forms. Exemplary is a noise signal. Each of the stacks of convolutional layers produces filter responses. Correlations between such filter responses—between channels of layers—are also determined. The test image is then adjusted, so that its filter responses more closely approximate those of the watermark signal block, and so that its texture-indicating correlation data more closely approximates that of the geometrical pattern block.

Diving more deeply, consider the watermark image applied to the network to be denoted as {right arrow over (w)}. This image can be characterized by the networks filters' responses to this image. A layer l in the first stack of FIG. 5, with 64 (N_l) channels, has 64 distinct feature maps, each of size M_l, where M_lis the number of elements in the feature map (width times height, or 222²). A matrix W^l, of dimensions 64×222²can thus be defined for each layer in the network, made up of the 64 filter responses for each of the 222²positions in the output from layer l. W^l_ijdefines the activation of the ith filter at position j in this layer.

If a test image {right arrow over (x)} is applied to the network, a different matrix is similarly defined by its filter responses. Call this matrix X. A loss function can then be defined based on differences between these filter response matrices. A squared-loss error is commonly used, and may be computed, for a given layer l, as

Loss_W({right arrow over (w)},{right arrow over (x)},l)=½Σ_i,j(X_ij^l−W_ij^l)²

Taking the derivative of this function indicates the directions that the filter responses should change, and relative magnitudes of such changes:

$\frac{\partial ({Loss}_{W})}{\partial X_{ij}^{l}} = {\begin{matrix} {(X^{l} - W^{l;})}_{ij} if X_{ij}^{l} > 0 \\ 0 if X_{ij}^{l} < 0 \end{matrix}$

This derivative establishes the gradients used to adjust the test image using back-propagation gradient descent methods. By iteratively applying this procedure, the test image can be morphed until it generates the same filter responses, in a particular layer of the network, as the watermark image {right arrow over (w)}.

The geometrical pattern image can be denoted as {right arrow over (g)}. Rather than use filter responses, per se, as features of this image, a variant feature space is built, based on correlations between filter responses, using a Gram matrix.

As is familiar to artisans in such technology, and as detailed in a Wikipedia article entitled Gramain Matrix (attached to application 62/596,730), a Gram matrix is composed of elements that are inner products of vectors in a Euclidean space. In the present case the vectors involved are the different filter responses resulting when the geometrical pattern is applied to the network.

Consistent with the former notation used for the watermark pattern image {right arrow over (w)} and the test image {right arrow over (x)}, the filter responses for a given layer l of geometrical pattern image {right arrow over (g)} are expressed as the matrix G_ij^l, where i denotes one of the N filter channels (e.g., one of the 64 filter channels for the first layer), and j denotes the spatial position (from among 222²positions for the first layer). The correlation-indicating Gram matrix C for a particular layer in geometrical pattern image {right arrow over (g)} is an N×N matrix:

$C_{ij}^{l} = \sum_{k} (G_{ik}^{l} G_{jk}^{l})$

That is, the Gram matrix is the inner product between the vectorized filter responses i and j in layer l, when the geometrical pattern image {right arrow over (g)} is applied to the network (i.e., the inner product between intermediate activations—a tensor operation that results in an N×N matrix for each layer of N channels).

This procedure is again applied twice: once on the geometrical pattern image, and once on the test image. Two feature correlations result, as indicated by the two Gram matrices. Gradient descent is then applied to the test image to minimize a mean-squared difference between elements of the two Gram matrices.

If C_ij^las defined above is the Gram matrix for the geometrical pattern image, and D_ij^lis the corresponding Gram matrix for the test image, then the loss E associated with layer l for the two images is:

$E_{l} = \frac{1}{4 N_{l}^{2} M_{l}^{2}} {Σ_{i, j} (C_{ij}^{l} - D_{ij}^{l})}^{2}$

The total loss function, across plural layers L, between the geometrical pattern image {right arrow over (g)} and the test image {right arrow over (x)} is then

${Loss}_{G} (\vec{g}, \vec{x}) = \sum_{l = 0}^{L} (a_{l} E_{l})$

where a_lare weighting factors for the contribution of each layer to the total loss.

The derivative of E_lwith respect to the filter responses in that layer is expressed as:

$\frac{\partial (E_{l})}{\partial X_{ij}^{l}} = {\begin{matrix} \frac{1}{4 N_{l}^{2} M_{l}^{2}} ({(X^{l})}^{T} {(C^{l} - D^{l})}_{ji}, if X_{ij}^{l} > 0 \\ 0, if X_{ij}^{l} < 0 \end{matrix}$

Again, standard error back-propagation can be applied to determine the gradients of E_l, which can be used to iteratively adjust the test image, to bring its correlation features (Gram matrix) closer to that of the geometrical pattern image.

While the two loss functions defined above, Loss_Wand Loss_G, can be used to adjust the test image to more closely approximate the watermark block image, and the geometrical pattern image, respectively, the loss functions are desirably applied jointly, in a weighted fashion. That is:

Loss_TOTAL=αLoss_W+βLoss_G

where α and β are weighting factors for the watermark pattern and the geometrical pattern, respectively.

The derivative of this combined loss function, with respect to the pixel values of the test image {right arrow over (x)}, is then used to drive the back propagation adjustment process. An optimization routine, BFGS, can optionally be applied. (C.f., Zhu et al, Algorithm 778: L-BFGS-B: Fortran Subroutines for Large-Scale Bound-Constrained Optimization, ACM Trans. on Mathematical Software, 23(4), pp. 550-560, 1997.)

In a particular embodiment, the loss factor Loss_Gis based on filter responses from many of the 16 different convolution layers in the network—in some cases all of the layers. In contrast, the loss factor Loss_Wis typically based on filter responses from a small number of layers—in some cases just one (e.g., the final layer in stack #4).

The foregoing discussion draws heavily from Gatys et al, Image Style Transfer Using Convolutional Neural Networks, Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2414-2423, 2016, and its predecessor Gatys, et al, A Neural Algorithm of Artistic Style. arXiv preprint arXiv:1508.06576, Aug. 26, 2015 (both familiar to artisans in that field, and both attached to application 62/596,730), which provide additional details. Those papers, however, particularly concern transferring artistic styles between digitized paintings and photographs, and do not contemplate machine readable codes (nor geometrical patterns).

The Gatys work is reviewed and expanded in Johnson, et al, Perceptual Losses for Real-Time Style Transfer and Super-Resolution, European Conference on Computer Vision, Oct. 8, 2016, pp. 694-711, and in Nikulin, et al, Exploring the Neural Algorithm of Artistic Style, arXiv preprint arXiv:1602.07188, Feb. 23, 2016 (both are again familiar to artisans in that field, and both are attached to application 62/596,730; the former method is sometimes termed “the method of Johnson” or “the Johnson method” herein).

The procedure detailed above is applied to iteratively adjust the test image until it adopts patterning from the geometrical pattern image to a desired degree, e.g., to a degree sufficient to obfuscate the mottled patterning of the watermark block—while still maintaining the machine readability of the watermark block.

FIG. 7 is a flow chart detailing aspects of the foregoing method.

FIG. 8 shows an illustrative application of the method. A bitonal geometrical pattern of the same size as the earlier-defined watermark signal block was created to serve as the style image, comprising concentric black and white circles. Its Gram matrix was computed, for the 16 layers in the FIG. 4 network. After 20 iterations, the detailed procedure updated an input random image until it took the form shown on the right.

It will be recognized that this pattern shown on the right of FIG. 8 is more suitable for incorporating into packaging artwork than the pattern shown on the right of FIG. 1.

FIG. 9 shows a similar example, this one based on a bitonal checkerboard pattern.

FIG. 10 shows another example, this one based on a pattern of tiled concentric squares.

As noted in the Background discussion, some printing technologies have substantial variability. Dry offset was particularly mentioned. When printing a press run of product packaging, this variability can cause a digital watermark pattern, which is intended to be rendered with an ink coverage resulting in human imperceptibility, to pop into conspicuous visibility.

The patterns of FIGS. 8, 9 and 10 are robust to such printing errors. If more or less ink is applied due to press variability, the package appearance will change somewhat. But it will not cause a pattern that is meant to be concealed, to become visible. And the strength of the encoded signal is robust—far to the right on FIG. 2, and far stronger than the strengths (e.g., “A” to “B”) with which watermarks intended for imperceptibility are applied. Supermarket scanners thus more reliably decode the product identifying information (e.g., GTINs) from such printing, even if only a fraction of the pattern is depicted in an image frame, or if contrast or glare issues would impair reading of an imperceptible watermark.

The patterns on the right of FIGS. 8-10 can be further processed to form line art images. FIG. 11 shows an example. Such line art images are even more robust to press variations, and extend application of this technology to other forms of printing that are commonly based on line structures (e.g., intaglio).

The FIG. 11 image is derived from the FIG. 10 (right side) image by first binarizing the image against a threshold value, e.g., of 128 (i.e., setting all pixels having a value above 128 to a value of 255, and setting all other pixels to a value of 0). A skeletonizing operation (e.g., a medial axis filter) is then applied to the binarized image. The “skeleton” thereby produced is then dilated, by a dilation filter (e.g., replacing each pixel in the skeleton with a round pixelated dot, e.g., of diameter 2-20). Such a procedure is illustrated by the flow chart of FIG. 12. (Additional information on such procedure is detailed in U.S. application Ser. No. 16/129,487.)

In a variant embodiment (i.e., not strictly line art), the FIG. 10 image is simply binarized. Such an image is shown in FIG. 13.

Through this process, decodability of the encoded payload should persist. Accordingly, at the end of any image processing operation—and optionally at interim steps (e.g., between iterations)—the test image may be checked to confirm that the watermark is still decodable. A watermark signal strength metric can be derived as detailed, e.g., in U.S. Pat. No. 9,690,967, and in application Ser. No. 16/129,487, filed Sep. 12, 2018, Ser. No. 15/816,098, filed Nov. 17, 2017, and Ser. No. 15/918,924, filed Mar. 12, 2018, the disclosures of which are incorporated herein by reference.

In the cited binarization operations, different thresholds may be tested, and strength metrics for the watermark signal can be determined for each. In choosing between candidate thresholds that all yield suitable patterning for use in a particular package, the threshold that yields the strongest watermark signal strength may be selected

Similarly, when using a neural network to morph a test image to take on attributes of a geometrical pattern, the geometrical pattern may come to dominate the morphed test image —depending on the weighting parameters α and β, and the number of iterations. Here, too, periodically assessing the strength of the watermark signal can inform decisions such as suitable weights for α and β, and when to conclude the iterative process.

Second Exemplary Embodiment

A second exemplary embodiment is conceptually similar to the first exemplary embodiment, so the teachings detailed above are generally applicable. In the interest of conciseness, such common aspects are not repeated here.

This second embodiment is hundreds or thousands of times faster than the first embodiment in applying a particular style to an image—requiring only a forward pass through the network. However, the network must first be pre-trained to apply this particular style, and the training process is relatively lengthy (e.g., four hours on a capable GPU machine).

An advantage of this embodiment for our purposes is that, once trained, the network can quickly apply the style to any watermark (or watermarked) input image. This method is thus suitable for use in a tool for graphic artists, by which they can apply one of dozens or hundreds of styles (for which training has been performed in advance) to an input pattern, e.g., to generate a signal-carrying pattern for incorporation into their artwork.

FIG. 14A is a screenshot of the user interface 140 of such a software tool. A palette of pre-trained neural styles is on the left, such as style 141 derived from Edvard Munch's “The Scream.” (Also included can be some algorithmic patterns 142, such as Voronoi, Delaunay and stipples, as referenced above.) The user clicks a button to select a desired style. In the upper right are dialog features 143 by which the user can specify attributes of a watermark pattern. That is, in this particular example, a watermark pattern is not input, per se. Rather, parameters for a watermark pattern are specified through the user interface, and the watermark pattern is generated on the fly, as part of operation of the software. (A sparse dot marking may be generated if an algorithmic pattern, such as Voronoi, is selected. A continuous tone watermark may be generated if one of the other styles is selected.) The selected style is then applied to the specified watermark.

In particular, the user enters parameters specifying the payload of the watermark signal (e.g., a 12 digit GTIN code), and the resolution of the desired watermark signal block in watermarks per inch (WPI). The user also specifies the physical scale of the watermark signal block (in inches), and the strength (“Durability”) with which the watermark signal is to be included in the finished, stylized, block. An analysis tool, termed Ratatouille, can be selectively invoked to perform a signal strength analysis of the resulting stylized block, in accordance with teachings of the patent documents referenced earlier. A moment after the parameters are input (typically well less than ten seconds), a stylized, watermarked signal block is created, and stored in an output directory with a file name that indicates metadata about its creation, such as the date, time, style, and GTIN. A rendering 144 of the signal block is also presented in the UI, for user review.

FIGS. 14B and 14C further detail aspects of the software's operation.

The pre-training to generate the neural styles 141 of FIG. 14 proceeds according to the method of Johnson, referenced earlier. As noted, this method is familiar to artisans; its method is briefly reviewed in a separate section, below. His paper has been cited over a thousand times in subsequent work. His method has been implemented in various freely-accessible forms. One is an archive of software code written by author Justin Johnson, available at github<dot>com/jcjohnson/fast-neural-style. (The <dot> convention is used in lieu of a period, to assure there is no browser-executable code in this specification, per MPEP § 608.) Another is an archive of code written by Logan Engstrom, available at github<dot>com/lengstrom/fast-style-transfer. Still another is code written by Adrian Rosebrock, using functionality of the open source library OpenCV (e.g., version 3.4.1), detailed at www<dot>pyimagesearch<dot>com/2018/08/27/neural-style-transfer-with-opencv/. The Web Archive (web<dot>archive<dot>org) has backups of all these materials.

Appendices A, B and C to application 62/745,219 form part of this specification and present material from the public gitbub web site—including source code—for the method of Johnson, and for its implementation by Engstrom and Rosebrock, respectively. Such code can use previously compiled style models, e.g., in a binary Torch format (*.t7). Sample t7 models are freely available from an archived Justin Johnson page from a Stanford web server, web<dot>archive<dot>org/web/20170917122404/http://cs.stanford.edu:80/people/jcjohns/fast-neural-style/models/instance_norm/. Alternatively, style models can be newly-created.

FIGS. 14D, 14E, 14F and 14G show the original images that were used as style images in the method of Johnson, to define the “Candy,” “Mosaic,” “Feathers,” and “La Muse” styles—each of which is a selectable option for applying to a watermark pattern, in the user interface of FIG. 14A.

FIG. 15A shows a result of the method, when a network using the method of Johnson is trained to apply a style derived from a photograph of rice grains, and this style is applied to a watermark pattern that encodes a particular plural-bit payload (in this case a continuous tone luminance watermark, like that shown in FIG. 1). FIGS. 15B-15ZZ show similar results, when the network is trained to apply other styles to such a watermarked pattern. Some of the styles are based on photographs of physical items, e.g., grains of rice, water droplets on grass, leaves of grass, pasta noodles, candy corn, woven basket reeds, denim fabric, etc. (natural imagery). Others of the styles are based on computer graphics (synthetic imagery), such as the geometric bitonal imagery used as inputs in FIGS. 8-10.

FIG. 15WW is generated by applying the Johnson method with the “Candy” style (e.g., as shown in FIG. 14) to a continuous tone watermark, converting to greyscale, increasing the contrast, and applying a blue spot color. FIG. 15XX is similarly generated, except that it is rendered in a duotone mode. That is, highlights and mid-tones of the image are identified and rendered in a first user-selected (light blue) color. The remaining, darker, tones are rendered in a second user-selected (darker blue) color.

FIG. 15YY is similar to FIG. 15WW, but generated with the “Feather” style. FIG. 15ZZ is similar, but generated with the “Mosaic” style.

It will be recognized that color features of the style images (Feather, Mosaic, etc.) are omitted from the images shown in FIGS. 15WW-15ZZ. The output from the neural network is often converted to greyscale and then false-colored—commonly in monochrome—for use in a particular packaging context. (It is often the case that structure is more important than color, when adapting features from a style image for use in consumer packaging. Color is commonly changed to suit the package colors. Compare, e.g., the colored gumballs of FIG. 15PP, and the falsely-colored gumball background of FIG. 28A.)

Applicant has discovered that style images can yield better or worse results depending on the atomic scale of elements depicted in the image. The watermark pattern, to which the style is applied, is commonly comprised of units (watermark elements, or “waxels”) that have a particular scale, e.g., regions of 2×2 or 3×3 pixels. The stylized results most faithfully follow the style image, and the watermark is most reliably readable from the stylized result, if the style image has a structure with elements of a comparable scale. Comparable here means a majority of elements in the style image have a scale in the range of 0.5 to 25× that of elements in the watermark. Thus, for waxels that are 3×3 pixels, individual rice grains in a style image desirably have a minimum dimension (width) of 1.5 pixels, and a maximum dimension (length) of 75 pixels. Likewise for blades of grass, weaves of fabric or basketry, droplets of water, pieces of candy corn, etc. (Best results are with items that are not just comparable, but close, in scale, e.g., in a range of 0.5×-3× the waxel dimension.)

By so doing, the style transfer network can morph the style pattern to assume spatial attributes of the watermark pattern, without either grossly departing from the style of the style image, nor distorting the scale of the watermark waxels so that their readability is impaired.

This is illustrated by FIGS. 16A-16C. FIG. 16A shows an excerpt of a watermark pattern. FIG. 16B shows a pattern resulting after a network trained with the Johnson method, using a raindrop style pattern, is applied to the FIG. 16A excerpt of a watermark pattern. FIG. 16C overlays FIGS. 16A and 16B. Note how the network has adapted the orientation, translation, scale and contours of the raindrops to conform to the orientation, translation, scale and contours of features within the watermark pattern. The correlation is somewhat astounding.

In the prior art, continuous tone watermark signals are commonly spread across host artwork at a very low amplitude, changing pixel values in an image a small amount—perhaps up to 10 or 20 digital numbers in an 8-bit (0-255) image. In some applications of style transfer-based watermarking, in contrast, the signal needn't be spread across the image; it may be lumpy—limited to local regions, e.g., just 10-30% of the image (as in the water droplets style of FIG. 16B). But in those local regions its amplitude can be much greater, e.g., spanning 100-, 200- or more bits of dynamic range; in some cases the full 256 bits. The net signal energy may be comparable, but its concentration in limited areas can make the signaling more robust to distortion and signal loss.

Review

To review, one aspect of the present technology concerns a method of generating artwork for labeling of a food package. Such method includes receiving imagery that encodes a plural-bit payload therein; neural network processing the imagery to reduce a difference measure between the processed imagery and a style image; and incorporating some or all of the neural network-processed imagery in artwork for labeling of a food package, where the plural-bit payload persists in the neural network-processed imagery. Such arrangement enables a compliant decoder module in a retail point of sale station to recover the plural-bit payload from an image of the food package.

In a related method, a first image (e.g., a host image, depicting a subject) is digitally-watermarked to encode a plural-bit payload therein, yielding a second image. This second image is neural network-processed to reduce a difference measure between the processed second image and a style image.

In such arrangements, the distance measure can be based on a difference between a Gram matrix for the processed second image and a Gram matrix for the style image.

(It will be recognized that reducing a difference measure between the processed imagery and the style image can be replaced by increasing a similarity measure between the processed imagery and the style image.)

A further aspect of the technology involves a method that starts by receiving a pattern that encodes a plural-symbol payload. This pattern is then stylized, using a neural network (typically previously-trained), to produce a stylized output image. The style is based on a style image having various features, wherein the network adapts features from the style image to express details of the received pattern, to thereby produce an image in which features from the style image contribute to encoding of said plural-symbol payload.

In another aspect, a method again starts with an input pattern that encodes a plural-symbol payload. This pattern is then stylized, using a neural network, to produce a stylized output image. The network is trained based on an image depicting a multitude of items of a first type (rain drops, rice grains, candy balls, leaves, blades of grass, stones, beans, pasta noodles, bricks, fabric threads, etc.). The network adapts scale, location, and rotation of the items as depicted in the stylized output image to echo, or mimic, features of the originally-input pattern, to thereby produce an image depicting plural of the items in a configuration that encodes said plural-symbol payload.

Another aspect of the technology concerns a method in which a user interface enables a user to define a multi-symbol payload, and to select from among plural different styles. In response to user selection of a first style, a sparse watermark pattern is generated. This pattern includes a web of lines defining plural polygons, with the arrangement of the polygons encoding the payload. In response to user selection of a second style, a continuous tone watermark pattern is generated, and elements of the second style are adapted to mimic elements of the watermark pattern and encode the payload.

Yet another method involves receiving a sparse pattern of points that encodes plural-symbol data, such as a Global Trade Item Number identifier. A transformed image is generated from this sparse pattern of points by a process that includes transforming the sparse pattern by applying a directional blur or wind filter in a first direction. Such filtering may also be applied to the sparse pattern in a second direction, and the results combined with the earlier results.

Another aspect of the technology involves a food package on which artwork is printed, where the artwork encodes a plural bit payload (e.g., including a Global Trade Item Number identifier). The artwork has first and second regions, which may be adjoining. The first region includes a background pattern that has been stylized from a watermarked input image using a neural network.

In such arrangement, the second region may include a watermarked image that has not been stylized using a neural network, yet the watermark included in the second region is geometrically-aligned with a watermark signal included in the first region. By such variant arrangement, a detector can jointly use signals from both regions in decoding the payload.

Still another aspect of the technology involves receiving a watermarked image, and then stylizing the watermarked image, to produce a styled output. This styled output is then false-colored, and incorporated into artwork.

A further aspect involves obtaining data defining a square watermark signal block, comprising N rows and N columns of elements. This signal block encodes an identifier, such as a Global Trade Item Number identifier. Plural of these row and columns are replicated along two adjoining edges of the square signal block, producing an enlarged signal block having M rows and M columns of elements. The enlarged block is then sent for processing to a style-transfer neural network. The enlarged signal block is received back after processing, and an N row by N column portion is excerpted from the received block. This excerpt is then used, in tiled fashion, within artwork to convey the identifier to camera-equipped devices (e.g., enabling a POS terminal to add an item to a shopper's checkout tally).

Yet another aspect is a method including: receiving a pattern of dots encoding a plural-symbol payload, and then padding the pattern of dots, by replicating portions of the pattern along at least two adjoining sides of the square, to yield a larger pattern of dots. A pattern of polygons is then generated from this larger pattern of dots. The polygon pattern can then be used as a fill or background pattern in commercial art, such as product packaging.

Still another method includes obtaining a watermark signal that encodes an identifier and is represented by watermark signal elements (e.g., waxels) of a first size. This watermark signal is stylized in accordance with a style image, which depicts multiple items of a first type (e.g., grains of rice, droplets of water, pieces of candy), where the items depicted in the style image have a size that is between 0.5 and 25 times said first size.

Yet another aspect of the technology concerns a method for producing a code for message signaling through an image. The method employs a neural network having an input, one or more outputs, and plural intermediate layers. Each layer comprises plural filters, where each filter is characterized by plural parameters that define a response of the filter to a given input. The method is characterized by iteratively adjusting an input test image, based on results produced by the filters of said neural network, until the test image adopts both (1) style features from one image, and (2) signal-encoding features from a second image.

Yet a further aspect of the technology is a method for producing a code for message signaling through an image, using a deep neural network—which may have been trained to perform object recognition. The network has an input, one or more outputs, and plural intermediate layers, where each layer includes multiple filters, each characterized by plural parameters that define a response of the filter to a given input. The method includes receiving an image comprising a machine readable code that is readable by a digital watermark decoder, where the machine readable code includes a signal modulated to convey a plural-symbol payload. This machine readable code image is presented to the network, causing the filters in the network layers to produce machine readable code image filter responses. The method further includes receiving an artwork image; presenting the artwork image to the network; and determining correlations between filter responses in different of the layers, to thereby determine style information for the artwork image. The method further includes receiving an initial test image, and presenting same to the neural network, causing filters in plural of the layers to produce test image filter responses. Style information for the test image is then determined by establishing correlations between the test image filter responses of plural of the filter layers. A first loss function is determined based on differences between the test image filter responses and the machine readable code image filter responses, for filters in one or more of said layers. A second loss function is determined based on a difference between the style information for the first and test images. The test image is then adjusted based on the determined first and second loss information. This procedure is repeated plural times, to incrementally adjust the test image to successively adopt more and more of the style of the artwork image. By such arrangement, the test image is transformed into an image in which the modulated signal is obfuscated by the style of the artwork image, yet is still readable by the digital watermark decoder.

Still another aspect of the technology concerns a method for producing an obfuscated code for message signaling through an image. This method includes: receiving a plural symbol payload; applying an error correction coding algorithm to the plural symbol payload to generate an enlarged bit sequence; and processing the enlarged bit sequence with pseudo-random data to yield a pseudo-random bit sequence. This bit sequence is mapped to locations in an image block. A counterpart image block depicts a spatial domain counterpart to plural signal peaks in a spatial frequency domain, and the two image blocks are combined to yield a code pattern from which the plural symbol payload can be recovered by a decoder. This code is applied to a multi-layer convolutional neural network. Filter responses for one or more layers in the network are stored as first reference data. A pattern (e.g., a bitonal geometric pattern) is then applied to the convolutional neural network. Data indicating correlations between filter responses between two or more of the layers, are stored as second reference data. A test image is then modified based on the first and second reference data, to yield an obfuscated code pattern, different from the code pattern and different from said geometrical pattern, from which the plural symbols can be recovered by the decoder.

As noted, one use of these technologies is in producing artwork for use in packaging retail items, such as grocery items. For example, a part of the artwork can be based on the stylized output image. This enables a camera-equipped point of sale apparatus to identify the package, by decoding the plural symbol payment (which may comprise a GTIN).

Another aspect of the technology is a packaged food product having label artwork that is derived from a pattern substantially similar (in a copyright sense) to one of the patterns illustrated in FIGS. 1, 8-11, 13, 14A, 15A-15ZZ, 19A, 19B, 21D, 21E, 29A-29C, 30A, 30B, 31A-41, and 44. The pattern may be monochrome or colored (included false-colored).

Review of the Johnson Method

The earlier-discussed Gatys approach has three inputs: a style image, a content image, and a test image (which may be a noise frame). A network iteratively modifies the test image until it reaches a suitable compromise between the style image and the content image—by optimizing a loss function based on differences from those first two inputs. The Johnson approach, in contrast, uses a single-pass feed-forward network, trained to establish parameters configuring it to apply a single style image. After training it has just one input—the content image.

Johnson's feed-forward image transformation network is a convolutional neural network configured by parameter weights W. It transforms input images x into output images ŷ by the mapping:

ŷ=f_W(x)

The weights W are established using information from a second, VGG-16 network, termed a loss network. This network is one that has been pretrained for image classification, e.g., on the ImageNet dataset. Such training configures it to produce features that represent semantic and perceptual information about applied imagery. These features are used to define a feature reconstruction loss and a style reconstruction loss that measure differences in content and style between images.

For each input image x there is a content target y_cand a style target y_s. For style transfer, the content target y_cis actually the input image x, and the output image ŷ should combine the content of x=y_cwith the style of ys. Two perceptual scalar loss functions are used in this process, to measure high-level semantic and perceptual differences between images. Instead of encouraging the pixels of the output image ŷ (i.e., f_W(x)) to exactly match the pixels of the target image y, the Johnson network encourages them to have similar feature representations as computed by the loss network.

The first loss function indicates similarity to the originally-input image. This is termed the feature reconstruction loss. Consider ϕj(x) as the activation of the jth layer of the loss network ϕ when processing an image x. If j is a convolution layer having C filters, then ϕj(x) will be a feature map of shape C_j×H_j×W_j. The feature reconstruction loss is the (squared, normalized) Euclidean distance between feature representations:

${G_{j}^{φ} (x)}_{c, c^{'}} = \frac{1}{C_{j} H_{j} W_{j}} \sum_{h = 1}^{H_{j}} \sum_{w = 1}^{W_{j}} {φ_{j} (x)}_{h, w, c} {φ_{j} (x)}_{h, w, c^{'}}$

An image ŷ that minimizes this feature reconstruction loss for early layers in the VGG-16 image recognition network, tends to produce images that are visually indistinguishable from y. That's not desired. Instead minimization of this feature reconstruction loss for one or more later layers in the network is employed. This produces images that preserve general shape and spatial structure, but permits variation in color, texture, and shape details. Thus, applying this feature reconstruction loss in training the image transformation network encourages the output image ŷ to be perceptually similar (semantically similar) to the target image y, but does not drive the images to correspond exactly.

The second loss function indicates style similarity to the style image. This is termed the style reconstruction loss. As in Gatys, a Gram matrix is used.

Again, let ϕj(x) be the activations at the jth layer of the loss network ϕ for the input x, which is a feature map of shape C_j×H_j×W_j. The Gram matrix G_j^ϕ(x) is the Cj×Cj matrix whose elements are given by:

$_{feat}^{φ, j} (\hat{y}, y) = \frac{1}{C_{j} H_{j} W_{j}} { φ_{j} (\hat{y}) - φ_{j} (y) }_{2}^{2}$

The style reconstruction loss is the squared Frobenius norm of the difference between the Gram matrices of the output and target images:

l_style^ϕ,j(ŷ,y)=∥G_j^ϕ(ŷ)−G_j^ϕ(y)∥_F²

Minimizing this style reconstruction loss minimizes stylistic differences between the output and target images, but may mess with the spatial structure. Johnson found that generating the style reconstruction loss using information from later layers in the VGG-16 network helps preserve large-scale structure from the target image. Desirably, information from a set of layers J, rather than a single layer j is used. In such case the style reconstruction loss is the sum of losses for all such layers.

In a particular embodiment, Johnson computes the feature reconstruction loss using feature data output by layer relu2_2 of the VGG-16 loss network, and computes the style reconstruction loss from feature data output by layers relu1_2, relu2_2, relu3_3 and relu4_3.

To train weights W of the image transformation network, Johnson applies the 80,000 images of the Microsoft COCO dataset to the network. The images output by the image transformation network are examined by the loss network to compute feature reconstruction loss and style reconstruction loss data. The weights W of the image transformation network are adjusted, in a stochastic gradient descent method, to iteratively reduce a weighted combination of the two loss function l₁and l₂, using corresponding regularization parameters λ_i:

$W^{*} = \arg \min_{W} E_{x, {y_{i}}} [\sum_{i = 1} λ_{i} _{i} (f_{W} (x), y_{i})]$

(Johnson applied normalization—updating weights W of his image transformation network—in batch fashion. Ulyanov later demonstrated that superior results may be achieved by applying normalization with a batch size of 1, i.e., with every training image. (See “Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis,” Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, 2017, pp. 6924-6932). For present purposes, the Ulyanov improvement is regarded as falling within the Johnson method.)

In the above-detailed manner, the image transformation network is trained to cause the output images to adopt the style of the style image, while still maintaining the general image content and overall spatial structure of the input images.

(Naturally, this is an abbreviated summary of the Johnson method. The reader is directed to the Johnson paper, and the many public implementations of his method on Github, for additional information.)

Further Arrangements

Applicant has extended the above-detailed methods to combine elements from three images. This is illustrated by FIGS. 17A-E.

FIG. 17A is first host artwork, in a vector format. FIG. 17B is a watermark pattern that encodes a particular payload. These two images are combined (e.g., by weighted summing, multiplication, or otherwise) to yield the watermarked artwork of FIG. 17C. FIG. 17D shows a style image with which a network has been trained by the Johnson method. FIG. 17E shows the composite artwork of FIG. 17C, after stylization by the trained Johnson network. (As just-discussed in connection with FIGS. 16A-16C, the stylized elements (droplets) in FIG. 17E are adapted in size, position, and contours, to conform to features within the FIG. 17B watermark pattern.)

One application of the just-described process is to generate logos of consumer brands, which are both watermarked (e.g., to convey a URL to a corporate web page) and stylized. Such stylized logos can be printed, e.g., on packaging artwork.

The host artwork needn't be bitonal (e.g., black and white), and it needn't be in vector format. Instead, it can be continuously-toned, such as a greyscale or RGB image. For example, FIG. 18A shows a watermarked version of the Lenna image, after stylization using the network that produced FIG. 15PP (i.e., applying a candy ball style). FIG. 18B shows a watermarked version of Lenna stylized with the same network that produced FIG. 15VV (i.e., applying a denim fabric style). FIG. 18C shows a watermarked version of Lenna stylized with the same network that produced FIG. 15F (applying a grass style).

Since stylization typically imparts a dominating texture on the artwork, the watermark signal can be embedded at an amplitude far greater than usual. As noted, continuous tone watermark signals are usually applied at amplitudes small enough to prevent the watermark patterning from being perceptible to casual viewers. But since the image texture is so-altered by stylization, patterning due to the watermark signal is masked—even if grossly visible prior to stylization (as in FIG. 17C).

As noted, the detailed methods can be applied to any watermark pattern or watermarked imagery. FIG. 19A shows a sparse watermark rendered as a Delaunay triangle pattern. FIG. 19B shows this same watermark after stylization with a leaf pattern. Again, each of the leaf elements in FIG. 19B has been scaled, rotated, and positioned to correspond to one of the component triangles within the Delaunay pattern. FIG. 19C shows enlargements of the lower right corners of the two images (indicated by the dashed squares in FIGS. 19A and 19B) overlaid together, by which the correspondence can be examined.

A single pattern can be parlayed into a dozen or more different patterns using invertible transforms. Invertible transforms are those that can be reversed—enabling a transformed set of information to be restored back to its original state.

In geometry, invertible transforms include affine transforms such as rotation, scaling (both scaling uniformly in x- and y-, and non-uniformly), translation, reflection, and shearing. Invertible geometrical transforms also include non-affine transforms, such as perspective, cylindrical, spherical, hyperbolic and conformal distortion. There are other invertible transforms as well, such as color transforms, color inversion, etc.

A general process employing this aspect of the technology is depicted by FIG. 20. A watermark pattern is generated, transformed, stylized, and inverse-transformed.

In a particular embodiment of the FIG. 20 process, a continuous tone watermark pattern is generated (FIG. 21A), containing a desired payload. A sparse mark (FIG. 21B) is derived from the FIG. 21A pattern, e.g., by thresholding, or by one of the techniques detailed in the cited patent documents. (The dots are enlarged in size for sake of visibility.) The FIG. 21B pattern is then asymmetrically (differentially) scaled—expanding the horizontal expanse by a factor of two—producing the FIG. 21C pattern. That pattern is then converted into a corresponding Voronoi pattern, shown in FIG. 21D. That pattern is then inverse-transformed—compressing the horizontal expanse by a factor of two—producing the FIG. 21E pattern.

The original watermark pattern is thus differentially-scaled, and then inverse-differentially-scaled. This restores the watermark pattern to its original state. But the Voronoi cells are added in the middle of the process. They do not undergo two complementary operations. Rather, that Voronoi pattern is just horizontally compressed by a factor of two (i.e., scaled by a horizontal factor of 0.5). Its appearance thus changes.

FIG. 23A shows the Voronoi pattern corresponding to the original dot pattern of FIG. 22B. FIG. 23B is the Voronoi pattern produced by the just-detailed transform/inverted-transform method (i.e., it is an enlargement of FIG. 22E). Each is shown overlaid with the original patterns of dots. As can be seen, each dot is still surrounded by a polygon. But the shapes of the polygons are different, as is particularly evident from the enlarged excerpts of FIGS. 24A and 24B (corresponding to the lower right hand corners of FIGS. 23A and 23B, respectively).

FIGS. 25A and 25B show that visible patterning encoded with machine readable data, produced according to the methods herein, can be used as a background pattern, or texture, on a food container—such as a cereal or cake mix box. Such patterning can appear at any location on the container, and on any face(s) (not just the illustrated front face). FIG. 26 shows such a pattern on a can, such as a drink can, a soup can, or a can of vegetables. FIG. 27 shows such a pattern on a plastic tub, such as a yogurt or sour cream tub. While a Voronoi pattern is illustrated, it should be understood that patterns produced by any of the processes referenced herein can be so-utilized.

FIGS. 28A and 28B are enlarged excerpts from a yogurt container, depicting signal-conveying artwork produced by the above-detailed process (employing a trained network according to the Johnson method) employed as background patterns. The pattern of FIG. 28A was produced using a Johnson network to apply a bubble gum balls pattern to a watermark signal block, e.g., as shown in FIG. 15PP. The pattern of FIG. 28B was produced by the method of Johnson to apply a fanciful pattern to a watermark signal block, e.g., as shown in FIG. 15D. Both patterns have been altered in coloration—an additional processing step that does not impair decodability of the luminance-based watermark signal blocks on which both are based.

The artwork on the yogurt containers depicted in FIGS. 28A/28B also includes a depiction of a vanilla tree flower, branch and leaf. This artwork is defined by layers of patterned spot color inks (e.g., Pantone 121, 356, 476) and black. For example, the flower petals are predominantly screened Pantone 121. That layer is watermarked with a luminance watermark. The payload conveyed by the watermark in the flower petals (i.e., the GTIN for the yogurt product) is the same payload as is conveyed by the watermark in the violet background patterning (which is rendered with screened Pantone 2071). Moreover, although printed in different ink layers, the watermark signal block portions encoded within the flower petals are geometrically registered with the watermark signal block portions encoded within the background patterning. (A few such blocks are shown by dashed lines in FIG. 28B; many span both the background pattern and the foreground vanilla art.) Both the foreground pattern and background pattern watermarks contribute signal by which a detector extracts the watermark payload; the detector doesn't distinguish between the signals contributed by the different layers since they seamlessly represent common watermark signal blocks.

When processing a watermark or watermarked image with a style transfer networks, edge artifacts can be a problem. Watermark patterns, themselves, are continuous at their edges. If tiled, watermark patterns continue seamlessly across the edges. Style patterns, in contrast, are typically not continuous at their edges. Further, the convolution operations performed by style transfer networks can “run out of pixels” when convolving at image edges. This can lead to disruptions in the edge-continuity of the watermark.

To address this problem, applicant pads the input watermark with rows and columns of additional watermark signal, as if from adjoining tiles, before applying to the style transfer network. A 128×128 element watermark signal block may be padded, for example, to 150×150 elements. This enlarged image is processed by the style transfer network. The network outputs a processed 150×150 image. The center 128×128 elements are then excerpted. No edge artifacts then appear.

In some instances, applicant generates a 256×256 element input image by tiling four watermark signal blocks. The network outputs a 256×256 style-processed image. Again, an interior (e.g., central) 128×128 block is excerpted from this result. (All of the signal blocks shown in FIGS. 15A-15ZZ are seamlessly tileable.)

Aspects of such method are illustrated in the flow charts of FIGS. 14B and 14C. (The dashed rectangles show acts that are usually performed later—not necessarily by the software whose user interface is depicted in FIG. 14A.)

FIG. 29A is an excerpt of product packaging showing a sparse dot pattern (a stipple pattern) as a graphical background (enlarged about 3×). FIG. 29B shows a Voronoi counterpart to the FIG. 29A pattern, again used as a graphical background. FIG. 29C shows a Delaunay counterpart to the FIG. 29A pattern.

Comparing the right sides of FIGS. 29A-29C, it will be seen that the introduction of more white features within the dark blue background serves to effectively lighten the blue background—at least as seen from a distance. In accordance with a further aspect of the present technology, a background spot or process color may be rendered darker than normal, so that the addition of a light-colored pattern of dots or lines serves to lighten the area to achieve a desired appearance. For example, an ink can be applied at a higher screen setting, or with a smaller proportion of a thinner, to yield a darker tone. Or a darker ink color can be selected.

Inversely on the left side. The addition of dark features darkens the light background color. This can be mitigated by starting with a lighter background color, so the addition of the darker features yields a composite appearance of a desired tone. The background ink layer may be screened-back, or the ink may be diluted. Or a lighter ink may be chosen. (Of course, if white is the background, as in FIGS. 29A-29C, then options to further lighten the background are limited.)

Triangulation patterns, based on sparse dot patterns, can be used as the bases for many aesthetically pleasing patterns suitable for product packaging, etc. One class of such patterns is to take a Delaunay pattern, and fill the triangles with different greyscale values, or with differently-screened tones of a particular color. As long as the lines bounding the triangles are lighter, or darker, than most of the triangle interiors, the pattern will decode fine. (The convergences of lighter or darker boundary lines at the triangle vertices define points of localized luminance extrema by which the encoded signals are represented.)

Another pleasing triangle-based arrangement is shown in FIG. 30A. This pattern is two patterns overlaid, each conveying a payload signal, but at different spatial resolutions—permitting decoding from two different (overlapping) reading distance zones.

In the FIG. 30A arrangement, the triangles are not bordered by lighter or darker lines. The vertices thus don't serve as luminance extrema that represent the encoded signals. Instead of carrying signal, the vertices and edges correspond to a sparse mark obtained from local extrema in the gradients within the watermark signal block. The motive is that each triangle encircles local minima or maxima in the signal tile. The lightness or darkness within each triangle corresponds to the lightness or darkness of a corresponding area in the watermark signal block. So the greyscale values of the triangles actually carry the signal. (In another arrangement, the triangles needn't be defined by gradients in the watermark signal. A random scattering of points can serve as vertices for a field of triangles and will work just as well, defining triangles whose interiors are made darker or lighter in correspondence with localized luminance within the desired watermark signal block.)

Due to the typically-small physical size of the component watermark elements (waxels), the triangles must be commensurately small to effective carry the signal—on the order of eight square waxels, or about 2000 triangles per 128×128 waxel block (the more the better). But such triangles are often smaller than desired for aesthetic purposes, e.g., when rendered by printing at 75 waxels per inch. So the same signal is added, with transparency, at a larger scale (e.g., 6×, or 12.5 waxels per inch). These larger triangles are more readily visible, but do not hinder the decoding of signal from the smaller triangles, because the larger triangles are effectively filtered-out by down-sampling and/or oct-axis filtering in the decoder.

In many embodiments, both the large and small triangles encode the same payload, although this needn't be in the case. In other arrangements, the larger scale overlay needn't be signal-bearing. It can be a strictly ornamental pattern, within which the smaller scale triangles serve as an intriguing, and signal-conveying, texture detail. In still other arrangements, polygons other than triangles can be employed. In some arrangements, the smaller triangles would be sub-triangulations of the larger triangles, to further enhance aesthetics. More simply, the vertices of each larger triangles may be snapped to the nearest vertices of the small triangles.

FIG. 30B shows a watermark signal block quantized into triangular areas, with the luminance of each area corresponding to the a luminance within the watermark.

FIG. 31A shows a pattern derived using Photoshop software to manipulate a sparse watermark (generated using any of the methods taught in the earlier-cited patent documents), and FIG. 31B shows an enlarged excerpt. The sparse, dark dots are here all smeared in straight parallel lines. The dots thus have corresponding tails which diminish in darkness with distance from the original location. The dots themselves appear inverted, as white regions.

This effect is generated by the following acts: A sparse dot watermark (black dots on white background) is created, with a desired payload, using the Digimarc Barcode plug-in software for Photoshop. Dot density is set to 20, and dot size is set to 4. The Photoshop Wind filter is applied, with the option From the Left selected. This Wind filter is applied three successive times. The resulting image is then inverted (Image/Adjustments/Invert), and the mode is changed to Lighten. A new layer is then created and the above acts are repeated, but this time with the Wind option set to From the Right.

(As is familiar to artisans, a wind filter detects edges in an image, and extends new features from the detected edges. The features are commonly tails that diminish in width and intensity with distance from the edges.)

The pattern of FIGS. 32A and 32B is generated in Photoshop software by creating a sparse dot watermark as before, with a dot density of 20 and a dot size of 4. The Motion Blur filter is then applied using settings of 0 Degrees and Distance=32 pixels.

The pattern of FIGS. 33A and 33B is generated by starting with a negative sparse dot watermark (i.e., white dots on a black background). Again, dot density is set to 20 and dot size is set to 4. A Motion Blur filter is applied with settings of 90 Degrees and Distance=15 pixels. A new layer is then created with the same dot pattern, and a Motion Blur filter is applied with settings of 0 degrees and Distance=15 pixels. The mode of the top layer is then changed to Lighten.

FIGS. 34A and 34B show a pattern produced from a sparse field of white dots on a black background, which are then smeared in straight parallel lines. Here the dots have corresponding tails which diminish in brightness with distance from the original location.

This pattern can be produced by generating a negative sparse dot watermark as just-detailed, with dot density of 20 and dot size of 4. A Motion Blur filter is applied, set to 0 Degrees, and Distance=15 pixels. A new layer is created with the same dot pattern. A Wind filter is then applied, with Direction set to From the Left. This filter is applied a second time on this layer. The mode of the top layer is then changed to Lighten.

The pattern of FIGS. 35A and 35B is produced by generating a negative sparse dot watermark as just-detailed, and applying a Motion Blur filter with settings of 90 degrees and Distance=15 pixels.

The pattern of FIGS. 36A and 36B is produced by from a field of black dots on a white, with dot density of 20 and dot size of 4. A Motion Blur filter is applied with settings of 90 Degrees and Distance=15 pixels. A new layer is created with the same dot pattern, and the Motion Blur filter is applied to this new layer with settings of 0 Degrees and Distance=15 pixels. The Mode of the top layer is then changed to Multiply.

The pattern of FIGS. 37A and 37B starts with a pattern of white dots on black, as before. The Wind filter is applied, set to From the Right. This filter is re-applied two more times. A new layer is then created with the same dot pattern, and the Wind filter is similarly applied three times, but this time set to From the Left. The mode of the top layer is then changed to Lighten.

FIGS. 38A and 38B look similar to FIGS. 39A and 39B, except each clump of dots numbers between 0 and 9 dots, depending on the local amplitude of the watermark. This pattern is generated by generating a sparse mark of black dots on a white background, as described earlier. A Motion Blur filter is applied with settings of 90 Degrees, and Distance=15 pixels. The mode is then changed to Bitmap, at 300 Pixels/Inch, and with a Halftone Screen. The halftone screen parameters are set to a Frequency of 50 Lines/Inch, an Angle of 90 Degrees, and a Shape of Line.

FIGS. 39A and 39B appear to show a continuous tone luminance watermark that is sampled at points along spaced-apart parallel lines. Around each sample point a clump of black dots (numbering from 0 to 4 dots) is formed, in accordance with the amplitude of the watermark at that point. A pattern of intermittent line-like structures, of varying thickness, is thereby formed.

Such pattern is produced by starting with a greyscale document set to 4% black. The Digimarc plug-in software is then used to enhance this document with a continuous tone luminance mark at Strength=10. The Photoshop software is then used to change the mode to Bitmap, with settings of 300 Pixels/Inch, Halftone Screen, a Frequency of 50 Lines/Inch, an Angle of 90 degrees, and Shape set to Line.

FIGS. 40A and 40B have the appearance of a continuous tone luminance watermark that is sampled at points in an orthogonal two-dimensional grid. A clump of black dots (e.g., numbering from 0 to 7 dots) is then formed around each sample point, in accordance with the amplitude of the watermark at that point.

Such pattern is produced by starting with a greyscale document set to 4% black. The Digimarc plug-in software is then used to enhance this document with a continuous tone watermark at Strength=10. The Photoshop software is then used to change the mode to Bitmap, with settings of 300 Pixels/Inch, Halftone Screen, a Frequency of 50 Lines/Inch, and an Angle of 45 degrees. The Shape is set to Line.

Of course the parameters given above can be modified more or less to yield results that are more or less different, as best suits a particular application. Similarly, the order of the detailed steps can be changed, and steps can be added or omitted. And naturally, the patterns can be colored, e.g., as shown in FIG. 41.

It will be recognized that a pattern like that shown in FIG. 41 has a visual appeal that a spattering of black dots, as in a plain sparse mark, does not. Such pattern, and others generated by the detailed techniques, thus give graphic designers greater options for encoding product identifiers on all surfaces of product packaging. And this new-found flexibility comes without the worry about press tolerances that was formerly a concern, since signal-carrying structures no longer need to be precisely rendered just below the threshold of human visibility.

Line Continuity Modulation

It should be recognized that there are other watermarking techniques besides the continuous tone and sparse arrangements detailed herein, and the detailed methods can be applied to artwork based on such other watermarking techniques. One example is line continuity modulation.

FIG. 42A shows an excerpt from artwork for a conference poster. FIG. 42B is an enlargement from the lower right corner of FIG. 42A. As can be seen, the shadows of the letters include diagonal dark blue lines that are broken into segments. These segments, and their breaks, serve to modulate the local luminance and chrominance of the artwork—modulations that are used to encode the watermark payload. (It will also be seen that the fills of the letters are Voronoi patterns based on a sparse watermark.)

The modulation of lines is implemented as follows: two images are created with slanted lines. All the lines are dashed, but lines in the first image have longer dashes than lines in the second image, so the average greyscale value of lines in the second image is lighter. A watermark signal block is thresholded to create a binary mask, so that watermark block signal values between 128 and 255 (in an 8-bit image representation) are white, while watermark signal values of 0-127 are black. Then we choose complete line segments from the second image within the white masked area, and we apply the black mask similarly to choose the segments from the first image. The lighter greyscale dashed lines are thus picked in regions where the watermark signal block is lighter, and the darker greyscale dashed lines are thus picked in regions where the watermark signal block is darker. The combination of the dashed lines from the two masked layers produces the embedded image. A horizontal line density of at least ⅔=67% lines per waxel is found to be best to assure robust detection.

In a variant implementation, line continuity modulation can be effected by poking holes in an array of continuous lines at positions defined by dots in a negative sparse mark.

Styluses, Etc.

Another aspect of the present technology concerns code signaling via manually-created art and other content.

Styluses are in widespread use for manually-creating content, both by graphics professionals (e.g., producing art) and by office workers (e.g., taking handwritten notes).

Styluses are commonly used with tablets. Tablets may be categorized in two varieties. One variety is tablets in which a stylus is applied to a touch-sensitive display screen of a computer (e.g., the Apple iPad device). The other variety is tablets in which a stylus is applied on a touch-sensitive pad that serves as a peripheral to a separate computer (e.g., the Wacom Intuos device). The latter device is sometimes termed a graphics tablet.

As is familiar to artisans, tablets repeatedly sense the X- and Y-locations of the tip of the stylus, allowing the path of the stylus to be tracked. A marking—such as a pen or pencil stroke—can thereby be formed on a display screen, and corresponding artwork data can be stored in a memory.

Various location-sensing technologies are used. So-called “passive” systems (as typified by many Wacom devices) employ electromagnetic induction technology, where horizontal and vertical wires in the tablet operate as both transmitting and receiving coils. The tablet generates an electromagnetic signal, which is received by an inductor-capacitor (LC) circuit in the stylus. The wires in the tablet then change to a receiving mode and read the signal generated by the stylus. Modern arrangements also provide pressure sensitivity and one or more buttons, with the electronics for this information present in the stylus. On older tablets, changing the pressure on the stylus nib or pressing a button changed the properties of the LC circuit, affecting the signal generated by the stylus, while modern ones often encode into the signal as a digital data stream. By using electromagnetic signals, the tablet is able to sense the stylus position without the stylus having to even touch the surface, and powering the stylus with this signal means that devices used with the tablet never need batteries.

Active tablets differ in that the stylus contains self-powered electronics that generate and transmit a signal to the tablet. These styluses rely on an internal battery rather than the tablet for their power, resulting in a more complex stylus. However, eliminating the need to power the stylus from the tablet means that the tablet can listen for stylus signals constantly, as it does not have to alternate between transmit and receive modes. This can result in less jitter.

Many tablets now employ capacitive sensing, in which electric coupling between electrodes within the tablet varies in accordance with the presence of an object—other than air—adjacent the electrodes. There are two types of capacitive sensing system: mutual capacitance, where an object (finger, stylus) alters the mutual coupling between row and column electrodes (which are scanned sequentially); and self- or absolute capacitance, where the object (such as a finger) loads the sensor or increases the parasitic capacitance to ground. Most smartphone touch screens are based on capacitive sensing.

Then there are some styluses that don't require a tablet—those that rely on optical sensing. These devices are equipped with small cameras that image features on a substrate, by which movement of the stylus can be tracked. Motion deduced from the camera data is sent to an associated computer device. The Anoto pen is an example of such a stylus.

The pressure with which the stylus is urged against the virtual writing surface can be sensed by various means, including sensors in the stylus itself (e.g., a piezo-electric strain gauge), and sensors in the tablet surface (e.g., sensing deflection by a change in capacitance). Examples are taught in patent documents 20090256817, 20120306766, 20130229350, 20140253522, and references cited therein.

A variety of other stylus/tablet arrangements are known but are not belabored here; all can be used with the technology detailed herein.

In use, a computer device monitors movements of the stylus, and writes corresponding information to a memory. This information details the locations traversed by the status, and may also include information about the pressure applied by the stylus. The location information may be a listing of every {X,Y} point traversed by the stylus, or only certain points may be stored—with the computer filling-in the intervening locations by known vector graphics techniques. Each stroke may be stored as a distinct data object, permitting the creator to “un-do” different strokes, or to format different strokes differently.

Each stroke is commonly associated with a tool, such as a virtual pen, pencil or brush. (The term “brush” is used in this description to refer to all such tools.) The tool defines characteristics by which the stroke is rendered for display, e.g., whether as a shape that is filled with a fully saturated color (e.g., as is commonly done with pen tools), or as a shape that is filled with a pattern that varies in luminance, and sometimes chrominance, across its extent (e.g., as is commonly done with pencil tools). When pressure is sensed, the pressure data may be used to vary the width of the stroke, or to increase the darkness or saturation or opacity of the pattern laid down by the stroke.

Users commonly think of their stylus strokes as applying digital “ink” to a virtual “canvas.” Different “layers” may be formed—one on top of the other—by instructions that the user issues through a graphical user interface. Alternatively, different layers may be formed automatically—each time the user begins a new stroke. The ink patterns in the different layers may be rendered in an opaque fashion—occluding patterns in any lower layer, or may be only partially opaque (transparent)—allowing a lower layer to partially show-through. (As noted, opacity may be varied based on stylus pressure. Additionally, going over a stroke a second time with the same tool may serve to increase its opacity.)

In practical implementation, all of the input information is written to a common memory, with different elements tagged with different attributes. The present description usually adopts the user's view—speaking of canvas and layers, rather than the well-known memory constructs by which computer graphic data are stored. The mapping from “canvas” to “memory” is straightforward to the artisan. Transparency is commonly implemented by a dedicated “alpha” channel in the image representation. (For example, the image representation may comprise red, green, blue and alpha channels.)

FIGS. 43 and 44 are tiles of illustrative signal-bearing patterns. These are greatly magnified for purposes of reproduction; the actual patterns have a much finer scale. Also, the actual patterns are continuous—the right edge seamlessly continues the pattern on the left edge, and similarly with the top and bottom edges. As described earlier, FIG. 43 may be regarded as a continuous pattern—comprised of element values (e.g., pixels) that vary through a range of values. FIG. 44 may be regarded as a sparse pattern

FIG. 43 also greatly exaggerates the contrast of the pattern for purposes of illustration. The depicted tile has a mean value (amplitude) of 128, on a 0-255 scale, and some pixels have values of near 0 and others have values near 255. More typically the variance, or standard deviation, from the mean is much smaller, such as 5 or 15 digital numbers. Such patterns commonly have an apparently uniform tonal value, with a mottling, or salt-and-pepper noise, effect that may be discerned only on close human inspection.

A first method for human-authoring of signal-bearing content is to tile blocks of a desired signal-carrying pattern, edge-to-edge, in a layer, to create a pattern coextensive with the size of the user's canvas. A second layer, e.g., of solid, opaque, white, is then applied on top of the first. This second layer is all that is visible to the user, and serves as the canvas layer on which the user works. The tool employed by the user to author signal-bearing content does not apply pattern (e.g., virtual ink, or pixels) to this second layer, but rather erases from it—revealing portions of the patterned first layer below.

Technically, the erasing in this arrangement is implemented as changing the transparency of the second layer—allowing excerpts of the underlying first layer to become visible. The tool used can be a pen, pencil, brush, or the like. The tool may change the transparency of every pixel within its stroke boundary to 100% (or, put another way, may change the opacity of every pixel to 0%). Alternatively, the brush may change the transparency differently at different places within the area of the stroke, e.g., across the profile of the brush.

FIG. 45 shows the former case—the brush changes the transparency of every pixel in its path from 0% to 100%, resulting in a pattern with sharp edges. FIG. 46 shows the latter case—different regions in the brush change the transparency differently—a little in some places, a lot in others. (In so-doing, the brush changes the mean value, and the variance, of the underlying pattern as rendered on the canvas.) Such a brush adds texture.

The texture added by a brush commonly has an orientation that is dependent on the direction the brush is used—just as filaments of a physical brush leave fine lines of texture in their wake. The signal-carrying pattern revealed by the brush has a texture, too. But the signal-carrying pattern texture always has a fixed orientation—both in rotation, scale and position. It does not matter in which direction the brush is applied; the spatial pose of the revealed pattern is constant.

In some embodiments, repeated brush strokes across a region successively increase the transparency of the top layer, revealing the underlying signal-carrying pattern to successively darker degrees. A first brush stroke may reveal the pattern by applying a 20% transparency to the covering second layer. Such a pattern has a light tone; a high mean value (and a small variance). A second brush over the same area stroke may increase the transparency to 40%, darkening the tone a bit, and increasing the variance. And then 60%, and 80%, until finally the covering layer is 100% transparent, revealing the underlying pattern with its original mean and variance. With each increase in transparency, the contrast of the rendered, revealed signal pattern is increased.

This is schematically illustrated by FIGS. 47 and 48. FIG. 47 conceptually illustrates how a prior art brush applies marks to a canvas—with a texture whose orientation is dependent on the directions of the strokes (shown by arrows). Where strokes overlap, a darker marking is rendered—since more virtual ink is applied in the region of overlap. FIG. 48 is a counterpart illustration for this aspect embodiment of the present technology. The orientation of the pattern does not depend on the direction of the brush stroke. Where strokes cross, the underlying pattern is rendered more darkly, with greater contrast.

The degree of transparency of a stroke can also be varied by changing the pressure applied to the stylus. At one pressure, a stroke may increase the transparency of the covering layer by 70% (e.g., from 0% to 70%). At a second, higher, pressure, a stroke may change the transparency of the covering layer by 100% (from 0% to 100%). Many intermediate values are also possible.

Such arrangement is shown in FIG. 49A. The digit ‘5’ is written at one pressure, and the digit ‘7’ is written with a lighter pressure. The latter stroke is rendered with less transparency (pixels within the boundaries of the ‘5’ stroke are rendered at 100% transparency, whereas the pixels within the ‘7’ are rendered at a lesser transparency). The mean value of the pixels within the boundary of the ‘5’ in FIG. 47 matches the mean value of the corresponding pixels in the underlying signal-carrying pattern. In contrast, the mean value of the pixels within the boundary of the ‘7’ have a mean value higher than that of the corresponding pixels in the underlying pattern.

(As noted, the contrast is greatly exaggerated in most of the images; FIG. 49B shows handwritten digits with contrast of a more typical value. FIG. 49C shows the handwritten digits with a more typical contrast and size.)

While the just-described arrangement employs an underlying, signal-conveying layer that is coextensive in size with the top layer, this is not necessary. Since the pattern is repeated in a tiled arrangement, the system memory may store just a single tile's worth of pattern data. Such a tile may have dimensions of 128×128 (e.g., pixels), while the canvas may have dimensions of 1280×1024. If the user picks a signaling brush (i.e., one that reveals the signal-carrying pattern), and draws a brush stroke starting from canvas X-, Y-coordinates {10,100}, and continuing to coordinates {10,200} (and encompassing a surrounding region dependent on a profile of the brush), the system effects a modulo operation to provide pattern when the original tile “runs-out” of data (i.e., after coordinate {10,128}. That is, for the start of the stroke, the erasing operation reveals the signal pattern between coordinates {10,100} and {10,128}. Once it reaches value 128 (in either dimension), it performs a mod-128 operation and continues. Thus, the pattern that is revealed at coordinates {10,130} is read from tile coordinates {10,2}, and so forth.

While the foregoing description was based on erasing a blank layer to reveal a signal-carrying layer beneath, more typically a different approach is used. That is, a brush applies digital ink to an artwork layer in a pattern conveying the plural-bit payload. The pattern is two-dimensional and stationary, with an anchor point (e.g., at the upper left corner of the canvas) to which the pattern is spatially related (i.e., establishing what part of the pattern is to be laid-down at what part of the canvas).

A simple implementation can be achieved by using the capability to pattern-draw that is built into certain graphics tools, like Photoshop. In that software, a tile of signal-conveying pattern can be imported as an image, selected, and then defined as a Photoshop pattern (by Edit/Define Pattern). The user then paints with this pattern by selecting the Pattern Stamp tool, selecting the just-defined pattern from the pattern menu, and choosing a brush from the Brush Presets panel. By selecting the “Aligned” option, the pattern is aligned from one paint stroke to the next. (If Aligned is deselected, the pattern is centered on the stylus location each time a new stroke is begun.)

Instead of modulating transparency, software may be written so that stylus pressure (or stroke over-writing) modulates the mean value of the payload-carrying pattern: darker pattern (i.e., lower mean values) is deposited with more pressure; lighter pattern (higher mean values) is deposited with less pressure. The signal-carrying strength of the pattern (i.e., its variance) can be set as a menu parameter of the brush with which the pattern is applied.

The signal-carrying pattern typically includes both plural-bit payload information, and also a synchronization signal. The payload information may include data identifying the user, the date, the GPS-indicated time, a product GTIN, or other metadata specified by the user. This information may literally be encoded in the payload, or the payload may be a generally-unique ID that serves as a pointer into a local or remote database where corresponding metadata is stored. As is familiar from the prior art (including U.S. Pat. No. 6,590,996), the payload is error-correction encoded, randomized, and expressed as +/−“chips” that serve to increase or decrease parameters (e.g., luminance or chrominance) of respective pixels in a reference signal tile. The synchronization signal can comprise an ensemble of multiple spatial domain sinusoids of different frequencies (and, optionally, of different phases and amplitudes), which can be specified succinctly by parameters in a frequency domain representation. A Fourier transform can then be applied to produce a corresponding spatial domain representation of any dimension. The resulting sinusoids are coextensive with the reference signal tile, and are added to it, further adjusting pixel values.

In some embodiments, the software enables the brushes to be configured to apply signal-carrying patterns having different means and variances, as best suits the user's requirements. To introduce highlights into an artwork, a pattern having a light tone, such as a mean digital value of 200, can be employed. To introduce shadows, a pattern having a dark tone, such as a mean value of 50, can be employed. The variance, too, may be user selectable—indicating the degree of visible mottling desired by the content creator. As is familiar from graphics programs such as Photoshop, the user may invoke software to present a user-interface by which such specific parameters of the brush can be specified. The software program responds by generating a reference signal tile (or a needed excerpt) having desired mean and variance values.

One particular approach to enabling such variation in mean amplitude and variance is to store reference data for the tile as an array of real-valued numbers, ranging from −1 to +1. (Such an array can be produced by summing a +/−chip, and a synchronization signal value, for each element in a tile, and then scaling the resultant array to yield the −1 to +1 data array.) This reference data can then be multiplied by a user-selected variance (e.g., 15, yielding values between −15 and +15), and then summed with a user-specified mean value (e.g., 200, yielding values between 185 and 215) to generate a reference tile with the desired variance and mean.

The variance or the mean, or both, may be modulated in accordance with the stylus pressure. If the pressure increases, the darkness can be increased by reducing the mean value. (E.g., a nominal pressure may correspond to a mean value of 128; greater pressures may cause this value to reduce—ultimately to 30; lesser pressures may cause this value to increase—ultimately to 220.)

The spatial scale of the signaling pattern can also be varied, e.g., by specifying a reference signal tile that conveys its information at a specific spatial scale. For example, a tile that is usually 128×128 pixels may instead be specified as 256×256 pixels, causing the spatial scale to double (which halves the spatial frequency of the signaling components). Again, the user can set such parameter to whatever value gives a desired visual effect. (The reference data can be generated accordingly, e.g., by spreading the payload chips over a 256×256 data array, and summing with a spatial synchronization signal that has been transformed from its frequency domain representation to a spatial signal having a 256×256 scale.) Typically, a smaller scale is used, so that the payload can be recovered from a smaller excerpt of pattern-based art.

Some drawing applications associate a (virtual) physical texture with the canvas. This causes certain tools to behave differently: the virtual ink deposited depends not just on the tool configuration, but also on the surface microtopology where the ink is applied. At points where the surface microtopology has a local maximum, more ink is deposited (being analogous to more pressure being applied—by the surface more firmly engaging the tool at such locations). At valleys between such maxima, less ink is deposited.

So, too, with embodiments of the present technology. A texture descriptor associated with the canvas serves to similarly modulate the contrast with which the signal-carrying pattern is rendered on the canvas. The pattern is rendered more darkly at locations where the texture has higher local peaks. As the user applies more pressure to the stylus, more of the valleys between the peaks are filled-in with pattern (or are filled-in with darker pattern).

Although the depicted examples use a continuous-tone monochrome pattern consisting of just signal (e.g., the tile of FIG. 43), in many applications other signal-carrying patterns are used. Exemplary are the signal-carrying patterns produced using the neural network techniques described above. FIGS. 50A and 50B show stylus strokes with exemplary neural network-produced patterning. Patterns based on natural imagery (e.g., the rice grains and water droplets of FIGS. 15A and 15B) and synthetic imagery (e.g., bitonal geometrical patterns of the sort used in FIGS. 8-10) can both be used.

FIG. 51 shows a prior art user interface 510 of a tablet drawing app (Sketchbook by Autodesk, Inc.). In addition to a drawing area 512, the interface includes a tools menu 514, a properties panel 516, and a layers guide 518. The user can pick up a desired tool (e.g., brush, pencil, etc.) by tapping the stylus on a corresponding icon in the tools menu 514. The properties panel 516 can be opened to customize the parameters of the tool. The layers guide 518 enables the user to select different layers for editing.

In some embodiments of the present technology, one or more tools are dedicated to the purpose of drawing with a signal-carrying pattern, as detailed above. These tools may be so denoted by including graphical indicia within the icon, such as the binary bits “101,” thereby clueing-in the user to their data encoding functionality. When a user selects such a tool from the tools menu 514, selections can be made from the properties panel 516 to define signal-customization parameters, such as the pattern to be used (e.g., the patterns of FIGS. 1, 11, 15A, 44, etc.), color, mean value, variance, and scale—in addition to the parameters usually associated with prior art tools.

In other embodiments, no tools are dedicated to applying signal-carrying patterns. Instead, all tools have this capability. This capability can be invoked, for a particular tool, making a corresponding selection in that tool's properties panel. Again, signal-customization options such as mean value, variance, and scale can be presented.

In still other embodiments, all tools apply signal-carrying patterns. The properties panel for each tool include options by which such signal can be customized.

To review, certain embodiments according to these aspects of the technology concern a method of generating user-authored graphical content. Such method makes use of a hardware system including a processor and a memory, and includes: receiving authoring instructions from a user, where the instructions taking the form of plural strokes applied to a virtual canvas by a virtual tool; and responsive to said instructions, rendering a first signal-carrying pattern on the canvas in a first area included within a first stroke. Such arrangement is characterized in that (1) the content conveys a plural-bit digital payload encoded by said signal-carrying pattern; and (2) the signal-carrying pattern was earlier derived, using a neural network, from an image depicting a natural or synthetic pattern.

Another method is similar, but is characterized in that (1) the graphical content conveys a plural-bit digital payload encoded by said signal-carrying pattern; (2) the first signal-carrying pattern is rendered with a first mean value or contrast, due a pressure with which the user applies the physical stylus to a substrate when making the first stroke; and (3) the second signal-carrying pattern is rendered with a second mean value or contrast due to a pressure with which the user applies the physical stylus to the substrate when making the second stroke, where the second value is different than the first.

Another method is also similar, but involves—responsive to input from the user's virtual tool—rendering a first signal-carrying pattern on the canvas in a first area included within a first stroke, and rendering a second signal-carrying pattern on the canvas in a second area included within a second stroke, where the rendered first and second patterns both correspond to a common reference pattern stored in the memory. Such arrangement is characterized in that (1) the graphical content conveys a plural-bit digital payload encoded by patterns defined by the plural strokes; and (2) elements of the first pattern, rendered within the first area, have a variance or mean amplitude that differs from a variance or mean amplitude, respectively, of spatially-corresponding elements of the reference pattern.

An example of this latter arrangement is illustrated by FIG. 52. Elements of the “7” on the right side of the figure (e.g., within the small box), have a mean amplitude that is different than that of spatially-corresponding elements of the reference pattern (e.g., within the corresponding small box in the center of the figure).

Graphical content produced by the described arrangements can be printed and scanned, or imaged by a camera-equipped device (like a smartphone) from on-screen or a print rendering, to produce an image from which the payload can be extracted, using techniques detailed in the cited art. (The image format is irrelevant—data can be extracted from TIF, JPG, PDF, etc., data.) This enables a great number of applications, including authoring artwork for product packaging that encodes a UPC code or that links to product information, communicating copyright, indicating irrefutable authorship, and authenticating content (e.g., by use of digital signature conveyed by the pattern). It also helps bridge the analog/digital divide, by enabling handwritten notes—on tablets and electronic whiteboards—to be electronically stored and searched, using metadata that is inseparably bound with the notes.

CONCLUDING REMARKS

Having described and illustrated our technology with reference to exemplary embodiments, it should be recognized that our technology is not so limited.

For example, while described in the context of generating patterns for use in printing retail packaging, it should be recognized that the technology finds other applications as well, such as in printed security documents.

Moreover, printing is not required. The patterns produced by the present technology can be displayed on, and read from, digital displays—such as smartphones, digital watches, electronic signboards, tablets, etc.

Similarly, product surfaces may be textured with such patterns (e.g., by injection molding or laser ablation) to render them machine-readable.

All of the patterns disclosed herein can be inverted (black/white), colored, rotated and scaled—as best fits particular contexts.

It should be understood that the watermarks referenced above include two components: a reference or synchronization signal (enabling geometric registration), and a payload signal (conveying plural symbols of information—such as a GTIN). It will be noted that each of the depicted signal-carrying patterns can be detected by the Digimarc app for the iPhone or Android smartphones.

While the detailed embodiments involving continuous tone watermarks usually focused on luminance watermarks, it should be recognized that the same principles extend straightforwardly to chrominance watermarks.

Although described in the context of imagery, the same principles can likewise be applied to audio—taking a pseudo-random payload-conveying signal, and imparting to it a structured, patterned audio aesthetic that renders it more sonically pleasing.

The VGG neural network of FIG. 4 is preferred, but other neural networks can also be used. Another suitable network is the “AlexNet” network, e.g., as implemented in Babenko, et al, Neural Codes for Image Retrieval, arXiv preprint arXiv:1404.1777 (2014)).

Naturally, the number and sizes of the layers and stacks can be varied as particular applications may indicate.

Additional code adapted by applicant for use in the detailed embodiments is publicly available at the github repository, at the web address github<dot>com/llSourcell/How_to_do_style_transfer_in_tensorflow/blob/master/Style_Transfer.ipynb. A printout of those materials is attached to application 62/596,730.

In the first embodiment, rather than start with a random image as the test image, the watermark image, or the geometrical pattern image, are instead used as starting points, and are then morphed by the detailed methods towards the other image.

While the detailed embodiments draw primarily from work by Gatys and Johnson regarding neural network-based style transfer, it will be recognized that there are other such neural network-based techniques—any of which can similarly be used as detailed above. Examples include Li et al, Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis, Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 2479-2486; Chen, et al, Stylebank: An explicit representation for neural image style transfer, CVPR 2017, p. 4; Luan, et al, Deep Photo Style Transfer, CVPR 2017, pp. 6997-7005; and Huang, et al, Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization, 2017 Intl Conference on Computer Vision, pp. 1510-1519. Others are detailed in U.S. Pat. No. 9,922,432, 20180158224, 20180082715, and 20180082407. Again, such techniques are familiar to artisans in the style transfer art.

Various examples of watermark encoding protocols and processing stages of these protocols are detailed in Applicant's prior work, such as our U.S. Pat. Nos. 6,614,914, 5,862,260, and 6,674,876, and US patent publications 20100150434 and 20160275639—all of which are incorporated herein by reference. More information on signaling protocols, and schemes for managing compatibility among protocols, is provided in U.S. Pat. No. 7,412,072, which is hereby incorporated by reference.

There are an increasing number of toolsets that have been designed specifically for working with neural networks. Caffe is one—an open source framework for deep learning algorithms, distributed by the Berkeley Vision and Learning Center. Another is Google's TensorFlow.

In certain implementations, applicant uses a computer equipped with multiple Nvidia TitanX GPU cards. Each card includes 3,584 CUDA cores, and 12 GB of fast GDDR5X memory.

Alternatively, the neural networks can be implemented in a variety of other hardware structures, such as a microprocessor, an ASIC (Application Specific Integrated Circuit) and an FPGA (Field Programmable Gate Array). Hybrids of such arrangements can also be employed, such as reconfigurable hardware, and ASIPs.

By microprocessor, Applicant means a particular structure, namely a multipurpose, clock-driven, integrated circuit that includes both integer and floating point arithmetic logic units (ALUs), control logic, a collection of registers, and scratchpad memory (aka cache memory), linked by fixed bus interconnects. The control logic fetches instruction codes from a memory (often external), and initiates a sequence of operations required for the ALUs to carry out the instruction code. The instruction codes are drawn from a limited vocabulary of instructions, which may be regarded as the microprocessor's native instruction set.

A particular implementation of one of the above-detailed algorithms on a microprocessor—such as the conversion of an iterated test image into a line art pattern—can begin by first defining the sequence of operations in a high level computer language, such as MatLab or C++ (sometimes termed source code), and then using a commercially available compiler (such as the Intel C++ compiler) to generate machine code (i.e., instructions in the native instruction set, sometimes termed object code) from the source code. (Both the source code and the machine code are regarded as software instructions herein.) The process is then executed by instructing the microprocessor to execute the compiled code.

Many microprocessors are now amalgamations of several simpler microprocessors (termed “cores”). Such arrangements allow multiple operations to be executed in parallel. (Some elements—such as the bus structure and cache memory may be shared between the cores.)

Examples of microprocessor structures include the Intel Xeon, Atom and Core-I series of devices. They are attractive choices in many applications because they are off-the-shelf components. Implementation need not wait for custom design/fabrication.

Closely related to microprocessors are GPUs (Graphics Processing Units). GPUs are similar to microprocessors in that they include ALUs, control logic, registers, cache, and fixed bus interconnects. However, the native instruction sets of GPUs are commonly optimized for image/video processing tasks, such as moving large blocks of data to and from memory, and performing identical operations simultaneously on multiple sets of data (e.g., pixels or pixel blocks). Other specialized tasks, such as rotating and translating arrays of vertex data into different coordinate systems, and interpolation, are also generally supported. The leading vendors of GPU hardware include Nvidia, ATI/AMD, and Intel. As used herein, Applicant intends references to microprocessors to also encompass GPUs.

GPUs are attractive structural choices for execution of the detailed arrangements, due to the nature of the data being processed, and the opportunities for parallelism.

While microprocessors can be reprogrammed, by suitable software, to perform a variety of different algorithms, ASICs cannot. While a particular Intel microprocessor might be programmed today to serve as a deep neural network, and programmed tomorrow to prepare a user's tax return, an ASIC structure does not have this flexibility. Rather, an ASIC is designed and fabricated to serve a dedicated task, or limited set of tasks. It is purpose-built.

An ASIC structure comprises an array of circuitry that is custom-designed to perform a particular function. There are two general classes: gate array (sometimes termed semi-custom), and full-custom. In the former, the hardware comprises a regular array of (typically) millions of digital logic gates (e.g., XOR and/or AND gates), fabricated in diffusion layers and spread across a silicon substrate. Metallization layers, defining a custom interconnect, are then applied—permanently linking certain of the gates in a fixed topology. (A consequence of this hardware structure is that many of the fabricated gates—commonly a majority—are typically left unused.)

In full-custom ASICs, however, the arrangement of gates is custom-designed to serve the intended purpose (e.g., to perform a specified function). The custom design makes more efficient use of the available substrate space—allowing shorter signal paths and higher speed performance. Full-custom ASICs can also be fabricated to include analog components, and other circuits.

Generally speaking, ASIC-based implementations of the detailed arrangements offer higher performance, and consume less power, than implementations employing microprocessors. A drawback, however, is the significant time and expense required to design and fabricate circuitry that is tailor-made for one particular application.

An ASIC-based implementation of one of the above arrangements again can begin by defining the sequence of algorithm operations in a source code, such as MatLab or C++. However, instead of compiling to the native instruction set of a multipurpose microprocessor, the source code is compiled to a “hardware description language,” such as VHDL (an IEEE standard), using a compiler such as HDLCoder (available from MathWorks). The VHDL output is then applied to a hardware synthesis program, such as Design Compiler by Synopsis, HDL Designer by Mentor Graphics, or Encounter RTL Compiler by Cadence Design Systems. The hardware synthesis program provides output data specifying a particular array of electronic logic gates that will realize the technology in hardware form, as a special-purpose machine dedicated to such purpose. This output data is then provided to a semiconductor fabrication contractor, which uses it to produce the customized silicon part. (Suitable contractors include TSMC, Global Foundries, and ON Semiconductors.)

A third hardware structure that can be used to implement the above-detailed arrangements is an FPGA. An FPGA is a cousin to the semi-custom gate array discussed above. However, instead of using metallization layers to define a fixed interconnect between a generic array of gates, the interconnect is defined by a network of switches that can be electrically configured (and reconfigured) to be either on or off. The configuration data is stored in, and read from, a memory (which may be external). By such arrangement, the linking of the logic gates—and thus the functionality of the circuit—can be changed at will, by loading different configuration instructions from the memory, which reconfigure how these interconnect switches are set.

FPGAs also differ from semi-custom gate arrays in that they commonly do not consist wholly of simple gates. Instead, FPGAs can include some logic elements configured to perform complex combinational functions. Also, memory elements (e.g., flip-flops, but more typically complete blocks of RAM memory) can be included. Again, the reconfigurable interconnect that characterizes FPGAs enables such additional elements to be incorporated at desired locations within a larger circuit.

Examples of FPGA structures include the Stratix FPGA from Altera (now Intel), and the Spartan FPGA from Xilinx.

As with the other hardware structures, implementation of the above-detailed arrangements begins by specifying a set of operations in a high level language. And, as with the ASIC implementation, the high level language is next compiled into VHDL. But then the interconnect configuration instructions are generated from the VHDL by a software tool specific to the family of FPGA being used (e.g., Stratix/Spartan).

Hybrids of the foregoing structures can also be used to implement the detailed arrangements. One structure employs a microprocessor that is integrated on a substrate as a component of an ASIC. Such arrangement is termed a System on a Chip (SOC). Similarly, a microprocessor can be among the elements available for reconfigurable-interconnection with other elements in an FPGA. Such arrangement may be termed a System on a Programmable Chip (SORC).

Another hybrid approach, termed reconfigurable hardware by the Applicant, employs one or more ASIC elements. However, certain aspects of the ASIC operation can be reconfigured by parameters stored in one or more memories. For example, the weights of convolution kernels can be defined by parameters stored in a re-writable memory. By such arrangement, the same ASIC may be incorporated into two disparate devices, which employ different convolution kernels. One may be a device that employs a neural network to recognize grocery items. Another may be a device that morphs a watermark pattern so as to take on attributes of a desired geometrical pattern, as detailed above. The chips are all identically produced in a single semiconductor fab, but are differentiated in their end-use by different kernel data stored in memory (which may be on-chip or off).

Yet another hybrid approach employs application-specific instruction set processors (ASIPS). ASIPS can be thought of as microprocessors. However, instead of having multipurpose native instruction sets, the instruction set is tailored—in the design stage, prior to fabrication—to a particular intended use. Thus, an ASIP may be designed to include native instructions that serve operations associated with some or all of: convolution, pooling, ReLU, etc., etc. However, such native instruction set would lack certain of the instructions available in more general purpose microprocessors.

Reconfigurable hardware and ASH′ arrangements are further detailed in U.S. Pat. No. 9,819,950, the disclosure of which is incorporated herein by reference.

Processing hardware suitable for neural network are also widely available in “the cloud,” such as the Azure service by Microsoft Corp, and CloudAI by Google.

In addition to the toolsets developed especially for neural networks, familiar image processing libraries such as OpenCV can be employed to perform many of the methods detailed in this specification. Software instructions for implementing the detailed functionality can also be authored by the artisan in C, C++, MatLab, Visual Basic, Java, Python, Tcl, Perl, Scheme, Ruby, etc., based on the descriptions provided herein.

Software and hardware configuration data/instructions are commonly stored as instructions in one or more data structures conveyed by tangible media, such as magnetic or optical discs, memory cards, ROM, etc., which may be accessed across a network.

Some of applicant's other work involving neural networks is detailed in patent application Ser. No. 15/726,290, filed Oct. 5, 2017, Ser. No. 15/059,690, filed Mar. 3, 2016 (now U.S. Pat. No. 9,892,301), Ser. No. 15/149,477, filed May 9, 2016, Ser. No. 15/255,114, filed Sep. 1, 2016 (now U.S. Pat. No. 10,042,038), and in published applications 20150030201, 20150055855 and 20160187199.

Additional information about the retail environments in which the encoded signals are utilized is provided in published applications 20170249491, and in pending application Ser. No. 15/851,298, filed Dec. 21, 2017.

This specification has discussed various arrangements. It should be understood that the methods, elements and features detailed in connection with one arrangement can be combined with the methods, elements and features detailed in connection with other arrangements.

In addition to the prior art works identified above, other documents from which useful techniques can be drawn include:

Goodfellow, et al, Generative adversarial nets, Advances in Neural Information Processing Systems, 2014 (pp. 2672-2680);
Ulyanov, et al, Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022, Jul. 27, 2016;
Zhu, et al, Unpaired image-to-image translation using cycle-consistent adversarial networks. arXiv preprint arXiv:1703.10593, Mar. 30, 2017.

Moreover, it will be recognized that the detailed technology can be included with other technologies—current and upcoming—to advantageous effect. Implementation of such combinations should be straightforward to the artisan from the teachings provided in this disclosure.

While this disclosure has detailed particular ordering of acts and particular combinations of elements, it will be recognized that other contemplated methods may re-order acts (possibly omitting some and adding others), and other contemplated combinations may omit some elements and add others, etc.

Although disclosed as complete systems, sub-combinations of the detailed arrangements are also separately contemplated (e.g., omitting various of the features of a complete system).

While certain aspects of the technology have been described by reference to illustrative methods, it will be recognized that apparatuses configured to perform the acts of such methods are also contemplated as part of Applicant's inventive work. Likewise, other aspects have been described by reference to illustrative apparatus, and the methodology performed by such apparatus is likewise within the scope of the present technology. Still further, tangible computer readable media containing instructions for configuring a processor or other programmable system to perform such methods is also expressly contemplated.

To provide a comprehensive disclosure, while complying with the patent Act's requirement of conciseness, Applicant incorporates-by-reference each of the documents referenced herein. (Such materials are incorporated in their entireties, even if cited above in connection with specific of their teachings.) These references disclose technologies and teachings that Applicant intends be incorporated into the arrangements detailed herein, and into which the technologies and teachings presently-detailed be incorporated.

Claims

1. A method of generating artwork for labeling of a food package, comprising the acts:

receiving imagery that encodes a plural-bit payload therein;

neural network processing the imagery to reduce a difference measure between the processed imagery and a style image, yielding an output image; and

incorporating some or all of the output image in artwork for labeling of a food package;

wherein the plural-bit payload persists in the output image, enabling a compliant decoder module in a retail point of sale station to recover the plural-bit payload from an image of the food package.

2. The method of claim 1 in which the distance measure is based on a difference between a Gram matrix for the processed imagery and a Gram matrix for the style image.

3. The method of claim 1 in which the style image includes various features, wherein the neural network adapts features from the style image to express details of the received imagery, to thereby yield an output image in which features from the style image contribute to encoding of said plural-bit payload.

4. The method of claim 1 in which the style image depicts a multitude of items of a first type, wherein the neural network processing adapts scale, location, and rotation of said items as depicted in the output image to echo features of the received imagery, to thereby produce an output image depicting plural of said items in a configuration that encodes said plural-bit payload.

5-17. (canceled)

18. A food package having artwork printed thereon, the artwork encoding a payload including a Global Trade Item Number identifier, the artwork having first and second regions, the first region including a background pattern that has been stylized from a watermarked input image using a neural network.

19. The package of claim 18 in which the second region includes a watermarked image that has not been stylized using a neural network, wherein the watermark included in the second region is geometrically-aligned with a watermark signal included in the first region, wherein a detector can jointly use signals from both regions in decoding the payload.

20. The food package of claim 18 in which the first region includes a background pattern that has been stylized from a watermarked input image using a neural network, and then false-colored.

21-26. (canceled)

27. A method comprising the acts:

obtaining a watermark signal, the watermark signal encoding an identifier and being represented by watermark signal elements of a first size;

stylizing the watermark signal in accordance with a style image, the style image depicting a multitude of items of a first type; and

employing the stylized watermark signal in printing on an object;

wherein the stylized watermark signal enables the object to be identified by a camera-equipped apparatus; and

the items depicted in the style image have a size that is between 0.5 and 25 times said first size.

28. The method of claim 27 in which the watermark signal is a host image that has been encoded with a watermark payload.

29. The method of claim 27 in which the watermark signal is solitary, i.e., not encoded in a host image.

30-54. (canceled)