For form components like checkboxes, you need to use the pre-processor PdfFormToXML_xx .
the attached image contains two attachment.
one of which is the original pdf (image 1)containing a table format.While I am trying to extract the data from an individual cell(multi-line) using the content (after parsing using pdftoText4). It seems to select the data of the adjoining column. I want to extract the data for individual column . for e.g. the company name tag should contain data from column next to it. and so on for other column too. I have used the table formatting as well but no help. The same can be seen in second attachment(image 2) how the text is being selected in the data processor transformation.informatica client.
If you do not have form components for checkboxes, you need to set encoding as UTF-8 and use the pre-processor PdfToTxt_x .