What is the most efficient way of image document classification?

Asked Mar 06 '23 at 13:58

Active Mar 06 '23 at 14:01

Viewed 14 times

So, I am working on a project where I have to extract sales tax invoice from the pdf document which contains other files along with the invoice. I researched on the topic, and am considering two solutions.

Converting pdf to images and then performing image classification with vgg-16 etc.
Using transformer model for document classification, it would convert the image/pdf page to text and then classify them. Both solutions have latency issues, in the first solution we'll have to convert pdf2image which slow and second uses ocr so it is also slow. So, I need some advice on how to approach this problem.

edited Mar 06 '23 at 14:01

asked Mar 06 '23 at 13:58

Sardar Arslan

What is the most efficient way of image document classification?

0 Answers0