On Evaluating Similarity Between Heterogeneous Data