Read PDF file


 

Read PDF file

In this section, you will learn how to read a pdf file.

In this section, you will learn how to read a pdf file.

Read PDF file

Java provides itext api to perform read and write operations with pdf file. Here we are going to read a pdf file. For this, we have used PDFReader class. The data is first converted into bytes and then with the use of StringBuffer,it will again converted into string and display the data on the command prompt.

Here is the code:

import java.io.*;
import java.util.*;
import com.lowagie.text.*;
import com.lowagie.text.pdf.*;

public class ReadPDF {
	public static void main(String[] args) throws IOException {
		try {
			Document document = new Document();
			document.open();
			PdfReader reader = new PdfReader("file.pdf");
			PdfDictionary dictionary = reader.getPageN(1);
			PRIndirectReference reference = (PRIndirectReference) dictionary
					.get(PdfName.CONTENTS);
			PRStream stream = (PRStream) PdfReader.getPdfObject(reference);
			byte[] bytes = PdfReader.getStreamBytes(stream);
			PRTokeniser tokenizer = new PRTokeniser(bytes);
			StringBuffer buffer = new StringBuffer();
			while (tokenizer.nextToken()) {
				if (tokenizer.getTokenType() == PRTokeniser.TK_STRING) {
					buffer.append(tokenizer.getStringValue());
				}
			}
			String test = buffer.toString();
			System.out.println(test);
		} catch (Exception e) {
		}
	}
}

Ads