Marcelo #34: don't know about the raw data, I read about the islands in some paper about Google PageRank algorithm. It is based in random walks on the web graph.