The CreatorCon Call for Content is officially open! Get started here.

Turn a PDF into Text

Edward Halliday
Mega Guru

I am looking to turn a pdf into text in ServiceNow. 

 

I have looked into document intelligence but it appear to be for pulling specific fields out of a PDF. I just want all the text in the PDF provided in a string. 

 

Does anyone know how to do this? Is there some sort of PDF parser utility? 

6 REPLIES 6

cagg
Tera Guru

Hi Edward - did you ever find a solution? I have the same ask right now.

Thanks, Jim

If you are licensed for it, you could leverage document intelligence to extract values: https://www.servicenow.com/docs/csh?topicname=doc-intel-workspace.html&version=latest

Thanks jonsan09 - let me have a look and circle back.

Thanks for the quick response!

Hi @jonsan09 , you've brought up a very good point. But a key limitation of Document Intelligence is that it only processes the first 25 pages of a document. If you need to extract data from a document longer than that, we may need to consider an alternative approach.

Additionally, we've found that Document Intelligence can have issues with extracting long text into a single field, and there's also a limit on the number of keys we can define for extraction.