136 lines
4.6 KiB
Markdown
136 lines
4.6 KiB
Markdown
# Readur Documentation
|
|
|
|
## Intelligent Document Management with OCR
|
|
|
|
Readur is a powerful document management system that transforms your unstructured documents into a searchable, organized knowledge base. Built for teams and individuals who need to efficiently manage, search, and access large document collections.
|
|
|
|
## Key Capabilities
|
|
|
|
<div class="grid cards" markdown>
|
|
|
|
- **[Getting Started](getting-started/installation.md)**
|
|
Deploy Readur with Docker in minutes - single command setup
|
|
|
|
- **[User Guide](user-guide.md)**
|
|
Master document management, OCR processing, and search features
|
|
|
|
- **[API Reference](api-reference.md)**
|
|
Automate workflows with REST API integration
|
|
|
|
- **[Deployment Guide](deployment.md)**
|
|
Production deployment with SSL, monitoring, and scaling
|
|
|
|
</div>
|
|
|
|
## Core Features
|
|
|
|
### Document Processing
|
|
- **Automatic OCR**: Extract text from scanned PDFs and images in 100+ languages
|
|
- **Multiple Formats**: Support for PDF, PNG, JPG, TIFF, TXT, and Office documents
|
|
- **Batch Processing**: Upload and process hundreds of documents simultaneously
|
|
- **Smart Queue**: Priority-based OCR processing with configurable concurrency
|
|
|
|
### Search & Discovery
|
|
- **Full-Text Search**: Find content within documents instantly
|
|
- **Advanced Filtering**: Search by date, type, size, labels, and OCR status
|
|
- **Boolean Operators**: Complex queries with AND, OR, NOT logic
|
|
- **Fuzzy Matching**: Handle typos and OCR errors automatically
|
|
|
|
### Organization
|
|
- **Flexible Labels**: Create custom taxonomies for document categorization
|
|
- **Bulk Operations**: Apply changes to multiple documents at once
|
|
- **Smart Collections**: Saved searches that update automatically
|
|
- **Multiple Views**: List and grid layouts with customizable sorting
|
|
|
|
### Integration & Automation
|
|
- **Source Synchronization**: Auto-import from WebDAV, S3, and local folders
|
|
- **REST API**: Complete programmatic access for automation
|
|
- **Watch Folders**: Monitor directories for automatic document ingestion
|
|
- **SSO Support**: Enterprise authentication with OIDC/OAuth2
|
|
|
|
## Quick Start Example
|
|
|
|
```bash
|
|
# 1. Clone the repository
|
|
git clone https://github.com/readur/readur.git
|
|
cd readur
|
|
|
|
# 2. Start with Docker Compose
|
|
docker-compose up -d
|
|
|
|
# 3. Access the interface
|
|
open http://localhost:8000
|
|
|
|
# Default credentials: admin / readur2024
|
|
```
|
|
|
|
## Common Use Cases
|
|
|
|
### Digital Archives
|
|
Convert paper documents into searchable digital archives. Perfect for:
|
|
- Legal firms managing contracts and case files
|
|
- Medical practices digitizing patient records
|
|
- Government agencies preserving historical documents
|
|
- Libraries and research institutions
|
|
|
|
### Business Document Management
|
|
Streamline document workflows and compliance:
|
|
- Invoice and receipt processing
|
|
- Contract management and search
|
|
- Policy and procedure documentation
|
|
- Compliance document tracking
|
|
|
|
### Personal Knowledge Base
|
|
Organize personal documents and research:
|
|
- Academic papers and research notes
|
|
- Tax documents and financial records
|
|
- Technical documentation and manuals
|
|
- Personal correspondence and archives
|
|
|
|
## System Architecture
|
|
|
|
```mermaid
|
|
graph LR
|
|
A[Document Upload] --> B[OCR Queue]
|
|
B --> C[Text Extraction]
|
|
C --> D[PostgreSQL Database]
|
|
D --> E[Full-Text Search]
|
|
F[Source Sync] --> B
|
|
G[Watch Folders] --> B
|
|
H[API] --> D
|
|
```
|
|
|
|
## Performance & Scalability
|
|
|
|
- **Concurrent Processing**: Handle multiple OCR jobs in parallel
|
|
- **Efficient Storage**: S3-compatible backend for unlimited scaling
|
|
- **PostgreSQL**: Enterprise-grade database with full-text search
|
|
- **Docker Native**: Container-based architecture for easy deployment
|
|
- **Resource Management**: Configurable memory and CPU limits
|
|
|
|
## Getting Help
|
|
|
|
### Documentation
|
|
- [User Guide](user-guide.md) - Complete feature documentation
|
|
- [Configuration Reference](configuration-reference.md) - All environment variables
|
|
- [Troubleshooting](s3-troubleshooting.md) - Common issues and solutions
|
|
- [Migration Guide](migration-guide.md) - Upgrade and migration procedures
|
|
|
|
### Community & Support
|
|
- [GitHub Issues](https://github.com/readur/readur/issues) - Bug reports and feature requests
|
|
- [GitHub Discussions](https://github.com/readur/readur/discussions) - Community help
|
|
- [Developer Documentation](dev/) - Architecture and development setup
|
|
|
|
## Latest Updates
|
|
|
|
### Version 2.5.4
|
|
- S3 storage backend support for unlimited scaling
|
|
- Enhanced source synchronization with health monitoring
|
|
- Improved OCR performance and accuracy
|
|
- OIDC/SSO authentication support
|
|
- Advanced search with boolean operators
|
|
|
|
---
|
|
|
|
!!! tip "Production Ready"
|
|
Readur is actively used in production environments processing millions of documents. See our [Deployment Guide](deployment.md) for best practices. |