Replies: 1 comment
-
maybe the hocr would work |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We've been using PdfPig to process PDF forms and, until just recently, had great success extracting text from documents. We've run into documents that contain pages, but no letters and result in no text extraction. The operations of our successful extractions contain a series of Text operations (e.g. TextObjects.BeginText, TextObjects.SetFontAndSize, TextObjects.ShowText, etc...), but these new documents contain PathConstruction operations (e.g. PathConstruciton.AppendStraightLineSegment, PathConstruction.BeginNewSubpath, etc...). Is there a way for us to extract the text from these new documents?
Beta Was this translation helpful? Give feedback.
All reactions