DocuClaw
// Sovereign document intelligence & archival system. Extracts structured data from invoices, receipts, and contracts 100% locally using AI.
$ git log --oneline --stat
stars:1,933
forks:367
updated:March 4, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
nameDocuClaw
descriptionSovereign document intelligence & archival system. Extracts structured data from invoices, receipts, and contracts 100% locally using AI.
DocuClaw Skill
DocuClaw provides a sovereign data infrastructure for processing and archiving documents. It uses multimodal LLMs to extract structured information from scans, photos, and emails, storing everything in human-readable, version-controllable Markdown files.
Use Cases
- Expense Management: Extract totals, taxes, and dates from receipts for tax filing.
- Contract Analysis: Monitor expiration dates and renewal clauses in legal documents.
- Sovereign Archival: Maintain a local-first, GDPR/GoBD compliant archive of all physical and digital mail.
- Unified Querying: Ask questions about your document history without cloud exposure.
Key Features
- 100% Local: Zero cloud dependency. Your private data never leaves your hardware.
- Plug-and-Play Parsers: Extensible architecture for country-specific document formats.
- AI-Powered: Supports Ollama, OpenAI Vision, or any multimodal model for intelligent extraction.
- Markdown Schema: Normalizes all documents into a universal schema with YAML metadata.
Workflow Example
- Input: A PDF invoice or a photo of a receipt.
- Process: Run
docuclaw processto trigger AI extraction. - Archive: Document is saved to your local vault as
YYYY/MM/filename.md. - Action: The extracted data is synced to your calendar or accounting tool.
Integration
DocuClaw is designed to work seamlessly with the OpenClaw ecosystem, allowing AI agents to perform RAG (Retrieval-Augmented Generation) over your local document archive.