Modifying the Generalized Delta Rule to Train Networks of Non-monotonic Processors for Pattern Classification