ZeroxPDFLoader
Overview
ZeroxPDFLoader is a document loader that leverages the Zerox library. Zerox converts PDF documents into images, processes them using a vision-capable language model, and generates a structured Markdown representation. This loader allows for asynchronous operations and provides page-level document extraction.
Integration details
| Class | Package | Local | Serializable | JS support |
|---|---|---|---|---|
| ZeroxPDFLoader | langchain_community | ❌ | ❌ | ❌ |
Loader features
| Source | Document Lazy Loading | Native Async Support |
|---|---|---|
| ZeroxPDFLoader | ✅ | ❌ |