drop out, learning rate in nn
作者:互联网
- use different initial learning rates, says: 1e-3, 1e-4, 1e-5, if 1e-5 is the best one, that means your network is too complicate. you may want reduce to the layers.
标签:says,nn,748,drop,reduce,rate,1e,learning,文章 来源: https://blog.csdn.net/seamanj/article/details/103982094