Russell Bateman last update:
Here's a Java library that converts HTML/CSS to PDF: https://github.com/danfickle/openhtmltopdf
Then, there's Apache Tika—a content analysis toolkit that's able to detect or recognize a PDF file from its contents as well as extract text and graphics from it.