PDF Text Extractor avatar
PDF Text Extractor

Pricing

Pay per usage

Go to Store
PDF Text Extractor

PDF Text Extractor

Developed by

Jiří Moravčík

Jiří Moravčík

Maintained by Community

PDF Text Extractor allows you to extract text from PDF files. It also supports chunking of the text to prepare the data for usage with large language models.

5.0 (1)

Pricing

Pay per usage

41

Total users

726

Monthly users

67

Runs succeeded

>99%

Issues response

22 hours

Last modified

2 months ago

ND

Output data is redacted

Open

andideng opened this issue
3 days ago

Hi, a lot of the data I extract from PDF are redacted. Is there a way to get around this?

jirimoravcik avatar

Hello, can you be more specific please, e.g. provide some examples that aren't in the extracted data? It's possible that the format of the PDF is just difficult to parse and the internal library struggles with that - there's sadly no way around that.