Conversion of PDF documents into HTML: a case study of document image analysis | IEEE Conference Publication | IEEE Xplore