4.6 KiB
Readur Documentation
Intelligent Document Management with OCR
Readur is a powerful document management system that transforms your unstructured documents into a searchable, organized knowledge base. Built for teams and individuals who need to efficiently manage, search, and access large document collections.
Key Capabilities
-
Getting Started
Deploy Readur with Docker in minutes - single command setup -
User Guide
Master document management, OCR processing, and search features -
API Reference
Automate workflows with REST API integration -
Deployment Guide
Production deployment with SSL, monitoring, and scaling
Core Features
Document Processing
- Automatic OCR: Extract text from scanned PDFs and images in 100+ languages
- Multiple Formats: Support for PDF, PNG, JPG, TIFF, TXT, and Office documents
- Batch Processing: Upload and process hundreds of documents simultaneously
- Smart Queue: Priority-based OCR processing with configurable concurrency
Search & Discovery
- Full-Text Search: Find content within documents instantly
- Advanced Filtering: Search by date, type, size, labels, and OCR status
- Boolean Operators: Complex queries with AND, OR, NOT logic
- Fuzzy Matching: Handle typos and OCR errors automatically
Organization
- Flexible Labels: Create custom taxonomies for document categorization
- Bulk Operations: Apply changes to multiple documents at once
- Smart Collections: Saved searches that update automatically
- Multiple Views: List and grid layouts with customizable sorting
Integration & Automation
- Source Synchronization: Auto-import from WebDAV, S3, and local folders
- REST API: Complete programmatic access for automation
- Watch Folders: Monitor directories for automatic document ingestion
- SSO Support: Enterprise authentication with OIDC/OAuth2
Quick Start Example
# 1. Clone the repository
git clone https://github.com/readur/readur.git
cd readur
# 2. Start with Docker Compose
docker-compose up -d
# 3. Access the interface
open http://localhost:8000
# Default credentials: admin / readur2024
Common Use Cases
Digital Archives
Convert paper documents into searchable digital archives. Perfect for:
- Legal firms managing contracts and case files
- Medical practices digitizing patient records
- Government agencies preserving historical documents
- Libraries and research institutions
Business Document Management
Streamline document workflows and compliance:
- Invoice and receipt processing
- Contract management and search
- Policy and procedure documentation
- Compliance document tracking
Personal Knowledge Base
Organize personal documents and research:
- Academic papers and research notes
- Tax documents and financial records
- Technical documentation and manuals
- Personal correspondence and archives
System Architecture
graph LR
A[Document Upload] --> B[OCR Queue]
B --> C[Text Extraction]
C --> D[PostgreSQL Database]
D --> E[Full-Text Search]
F[Source Sync] --> B
G[Watch Folders] --> B
H[API] --> D
Performance & Scalability
- Concurrent Processing: Handle multiple OCR jobs in parallel
- Efficient Storage: S3-compatible backend for unlimited scaling
- PostgreSQL: Enterprise-grade database with full-text search
- Docker Native: Container-based architecture for easy deployment
- Resource Management: Configurable memory and CPU limits
Getting Help
Documentation
- User Guide - Complete feature documentation
- Configuration Reference - All environment variables
- Troubleshooting - Common issues and solutions
- Migration Guide - Upgrade and migration procedures
Community & Support
- GitHub Issues - Bug reports and feature requests
- GitHub Discussions - Community help
- Developer Documentation - Architecture and development setup
Latest Updates
Version 2.5.4
- S3 storage backend support for unlimited scaling
- Enhanced source synchronization with health monitoring
- Improved OCR performance and accuracy
- OIDC/SSO authentication support
- Advanced search with boolean operators
!!! tip "Production Ready" Readur is actively used in production environments processing millions of documents. See our Deployment Guide for best practices.