What size network is good for generalization of a specific task of interest?