We're currently building and refining our Knowledge Center. You may encounter layout issues, styling inconsistencies, or minor technical quirks as we finalize the platform.

PDF Connector

Prev Next
Note on third‑party tools

We aim to provide accurate and helpful details about third‑party tools, but we can’t guarantee that this information is always complete or up to date. If you notice any discrepancies, feel free to share them in the feedback section below. For the most reliable information, please always refer to the third‑party tool’s official documentation.

PDF (Portable Document Format) is a file format developed by Adobe for presenting documents independently of software, hardware, or operating systems.
The pdf component allows finding text in PDF documents, listing page numbers, and extracting specific pages from a document.

Actions

Extract All Text

Extracts all text from the specified PDF document and returns it as an array of text strings.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.

Extract Page

Extracts the specified page from the PDF document and returns it as a new separate PDF document.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.
Page Number The page number to extract from the PDF.

Extract Page Text

Extracts text from the specified page range in the PDF document.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.
Page Start The starting page number for extraction.
Page End The ending page number for extraction. If not provided, only the start page is extracted.

Extract Text by Pattern

Extracts text from the specified PDF document that matches the search text.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.
Search Pattern This is the text to search for in the PDF document.
Characters After The number of characters to extract after the search pattern. If not provided, the entire page is returned.
Case Sensitive When true, the search is case-sensitive. false

Find Pattern

Searches the PDF document and returns page numbers containing text that matches the search criteria.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.
Search Pattern The text pattern to search for in the PDF document.
Case Sensitive When true, the search is case-sensitive. false
Use Regex When true, treats the search pattern as a regular expression. false
Contains When true, returns pages containing the pattern; when false, returns pages without the pattern. true

Page Numbers

Returns a sequence of page numbers for the PDF document, from 1 to the last page.

Input Comments Default
PDF Data The PDF file data to process. This can be a file reference from a previous step.