This is illustrated by decision T 0702/20 of the Technical Board of Appeal 3.5.06. of the European Patent Office, which refused to grant a patent for a neural network with sparse connection density.
According to the Board, a neural network defines a class of mathematical functions which, as such, is excluded from patentability. Such a network cannot therefore be patented without at least implicitly indicating a specific use case.
The invention at issue in the decision concerns an adaptation of the connections between the layers of a neural network with a concept known as "loose couplings", which reduces the number of connections between the neurons of several layers of the network compared with conventional fully-connected architectures. These loose couplings are illustrated by the dotted lines in figure 2 of the patent application reproduced below.
Claim 1 relates to the ab initio architecture of the neural network, even before it is trained to perform a classification task. In particular, the loose connections between the neurons are established prior to training, independently of the training data.
Furthermore, claim 1 does not mention any specificity of the data used for training nor does it specify for what purpose the classification might be used.
The architecture of a neural network, therefore, determines the operations performed by the neurons and the way in which they are connected to each other. This architecture defines a class of mathematical functions. An instance of this class is given by the set of weights that result from training the neural network to perform a given task (for example, detecting objects in images).
It thus states that this claim merely specifies abstract mathematical operations implemented by computer on unspecified data, these mathematical operations being those of defining a class of approximation functions (the network with its structure), solving a (complex) system of (non-linear) equations to obtain the parameters of the functions (learning the weights), and using it to calculate outputs for which it is not known what task they are intended.
The Board concludes that the claimed invention is not excluded from patentability because it uses a computer but because it does not involve an inventive step.
In this case, the applicant had tried to justify that the claimed network avoided overfitting by citing a scientific article on data-driven loose coupling (in which the connectivity pattern is determined from the task to be performed, based on the distribution of training data). This article was of no help as, the Board noted that it did not relate to the claimed loose coupling where the pattern of connectivity is determined independently of the training data.
Perhaps a different conclusion would have been reached if the patent application had contained comparative examples of the claimed network with pre-existing networks, for example in terms of learning speed or accuracy of the classification obtained.
This is why we recommend that, wherever possible, such comparative examples should be included in patent applications relating to Artificial Intelligence. This is particularly important given that the question of whether post-filing published evidence should be taken into account when examining inventive step is presently debated before the Enlarged Board of Appeal with reference G 2/21.