Hi .,
I need some suggestion in how to progress for the below senario.
I have some data in a pdf in the tabular format.
Date CustomerID Phone number ListingName
12/12/12 12345678 0123645697 ABC; XYZ 123, XYZ road.
Rules
The below rules needs to be applied and information needs to be fetched.
1. Need to check if the values under Listing name is in alphabetical order.
2. If the customer id starts with 5, and the listing name is repeated more than once Mark them.
Queries
1. The pdf content is around 3000 pages., Can it be read using itext ? Just wanted to know if the size of PDF to read using
Java.
2. Will we be able to mark the values in pdf using java.
3. I was planning to read pdf using itext, and apply regex
pattern to implement the same. But where and how do I store all the data. ?
4. Is it possible to store all the data in excel and apply the rules there. Is it possible to do marking in excel file using java.?
Thanks in advance for the help.
Please let me know if more clarification needs to be done on the requirement.