Domanda di colloquio di Microsoft

How to measure distance between data point?

Risposte di colloquio

Anonimo

29 mar 2016

It depends on whether the data point contains numeric, categorical or mixed data, and also on the purpose of the distance function. For numeric, Euclidian is simple, but cosine is robust to scaling and dtw is good for time series. For categorical, a basic one will be Jaccard similarity, which local sensitive hashing might apply.

5

Anonimo

12 giu 2015

Euclidean distance, manhattan distance

1