Difference between Layer and Batch normalization? Template matching, k-means, k-nn, naive bayesian classifier, SVM? Difference between RCNN, its variants and YOLO? Transformers and LSTMs? Why skip connections? Vanishing gradient problem? Random forest?