Page Compare

A simple heuristic for comparing the structural similarity of two HTML documents. This algorithm was also used for the clustering step in the construction of the Dark Web Map.