Besides the legal points brought up, actually parsing the documents would be nearly impossible. Scientific papers are generally found on sci-hub as PDFs identical to the printed copy, not as marked up metadata. Parsing such things is non-trivial (as the poorly parsed epub versions of pdfs found on archive.org attests)