Since my perl installation went horribly wrong-- it’s complaining about uninitialized values in join, I decided to go back to the King James Programming source; which seems less buggy. (It’s also in Python, which may please some of you).
But the corpuses I’m using seem rather dissimilar. HP Lovecraft writes of demons and doom, and so does the Bible, but he uses a more modern dialect of English, so shared stretches of three or four words are not very common. (you may correct me if I’m misunderstanding the algorithm).
I think kingjamesprogramming gets away with it because the bible is numbered, and so is the programming textbook.