![]() ![]() Let's go with simple SGD: You need to set learning rate, momentum, decay. You can start with simple Stochastic Gradient Descent (SGD), but there are many others. Then you need to choose a training algorithm. How many neurons to use in each layer? What activation functions to use? What weights initialization to use?Īrchitecture ready. How many layers to use usually 2 or 3 layers should be enough. You need some magic skills to train NN well. For NN you have more steps for preprocessing, so more steps to implement in the production system as well.įor RF, you set the number of trees in the ensemble (which is quite easy because of the more trees in RF the better) and you can use default hyperparameters and it should work. Keep in mind that all preprocessing that is used for preparing training data should be used in production.
0 Comments
Leave a Reply. |