Document Parser
v1.2.3Document Parser is an AI agent skill in the Data & Files category. Parse and extract structured data from PDFs, Word docs, spreadsheets, and images with OCR.
npx agentmag add doc-parserAbout
The Document Parser skill handles the messy reality of business documents. It can parse PDFs (including scanned ones via OCR), Word documents, Excel spreadsheets, PowerPoint presentations, and images — extracting text, tables, and metadata into clean, structured formats.
Goes beyond simple text extraction: it understands document layout, preserves table structure, handles multi-column text, and can extract specific fields from invoices, contracts, and forms.
Capabilities
- PDF parsing with layout-aware text extraction
- OCR for scanned documents and images
- Table extraction preserving row/column structure
- Word, Excel, PowerPoint, and CSV support
- Invoice and form field extraction
- Batch processing for document collections
Compatible agents
Add Document Parser to your agent
One command to install. Works with all major coding agents.
Related AI agent resources
Building an agent skill? Submit your AI agent skill for free or get featured placement.