282 lines
7.9 KiB
Markdown
282 lines
7.9 KiB
Markdown
# User Guide
|
|
|
|
A comprehensive guide to using Readur's features for document management, OCR processing, and search.
|
|
|
|
## Table of Contents
|
|
|
|
- [Getting Started](#getting-started)
|
|
- [Supported File Types](#supported-file-types)
|
|
- [Using the Interface](#using-the-interface)
|
|
- [Dashboard](#dashboard)
|
|
- [Document Management](#document-management)
|
|
- [Advanced Search](#advanced-search)
|
|
- [Folder Watching](#folder-watching)
|
|
- [Document Upload](#document-upload)
|
|
- [OCR Processing](#ocr-processing)
|
|
- [Search Features](#search-features)
|
|
- [Tags and Organization](#tags-and-organization)
|
|
- [User Settings](#user-settings)
|
|
- [Tips for Best Results](#tips-for-best-results)
|
|
|
|
## Getting Started
|
|
|
|
1. **First Login**:
|
|
- Navigate to `http://localhost:8000` (or your configured URL)
|
|
- Use the default admin credentials (username: `admin`, password: `readur2024`)
|
|
- **Important**: Change the default password immediately
|
|
|
|
2. **Initial Setup**:
|
|
- Configure your user preferences
|
|
- Set OCR language if different from English
|
|
- Adjust search and display settings
|
|
|
|
3. **Quick Start**:
|
|
- Upload your first document using drag-and-drop or the upload button
|
|
- Wait for OCR processing to complete
|
|
- Search for content within your documents
|
|
|
|
## Supported File Types
|
|
|
|
| Type | Extensions | OCR Support | Notes |
|
|
|------|-----------|-------------|-------|
|
|
| **PDF** | `.pdf` | ✅ | Text extraction + OCR for scanned pages |
|
|
| **Images** | `.png`, `.jpg`, `.jpeg`, `.tiff`, `.bmp`, `.gif` | ✅ | Full OCR text extraction |
|
|
| **Text** | `.txt`, `.rtf` | ❌ | Direct text indexing |
|
|
| **Office** | `.doc`, `.docx` | ⚠️ | Limited support |
|
|
|
|
## Using the Interface
|
|
|
|
### Dashboard
|
|
|
|
The dashboard provides an overview of your document system:
|
|
|
|
- **Document Statistics**:
|
|
- Total documents in the system
|
|
- Storage usage breakdown
|
|
- OCR processing status
|
|
- Recent activity timeline
|
|
|
|
- **Quick Actions**:
|
|
- Upload new documents
|
|
- Quick search bar
|
|
- Access to recent documents
|
|
- System notifications
|
|
|
|
### Document Management
|
|
|
|
#### List/Grid View
|
|
- **List View**: Detailed document information in a table format
|
|
- **Grid View**: Visual thumbnails for quick browsing
|
|
- Toggle between views using the view selector in the top toolbar
|
|
|
|
#### Sorting Options
|
|
- Upload date (newest/oldest first)
|
|
- File name (A-Z/Z-A)
|
|
- File size (largest/smallest)
|
|
- Document type
|
|
- OCR status
|
|
|
|
#### Filtering
|
|
- By file type (PDF, images, text)
|
|
- By OCR status (completed, pending, failed)
|
|
- By date range
|
|
- By tags
|
|
- By source (uploaded, watched folder)
|
|
|
|
#### Bulk Actions
|
|
1. Select multiple documents using checkboxes
|
|
2. Available bulk actions:
|
|
- Delete selected documents
|
|
- Add/remove tags
|
|
- Export document list
|
|
- Reprocess OCR
|
|
|
|
### Advanced Search
|
|
|
|
Readur offers powerful search capabilities:
|
|
|
|
#### Full-Text Search
|
|
- Search within document content
|
|
- Automatic stemming and fuzzy matching
|
|
- Phrase search with quotes: `"exact phrase"`
|
|
- Exclude terms with minus: `-excluded`
|
|
|
|
#### Search Filters
|
|
- **Date Range**: Find documents from specific time periods
|
|
- **File Type**: Limit search to specific formats
|
|
- **File Size**: Filter by document size
|
|
- **OCR Status**: Only search processed documents
|
|
- **Tags**: Search within tagged documents
|
|
|
|
#### Search Syntax
|
|
```
|
|
invoice 2024 # Find documents with both terms
|
|
"quarterly report" # Exact phrase search
|
|
invoice -draft # Exclude drafts
|
|
tag:important invoice # Search within tagged documents
|
|
type:pdf contract # Search only PDFs
|
|
```
|
|
|
|
### Folder Watching
|
|
|
|
The folder watching feature automatically imports documents:
|
|
|
|
1. **Non-destructive**: Source files remain untouched
|
|
2. **Automatic Processing**: New files are detected and processed
|
|
3. **Configurable Intervals**: Adjust scan frequency
|
|
4. **Multiple Sources**: Watch local folders, network drives, cloud storage
|
|
|
|
#### Setting Up Watch Folders
|
|
1. Go to Settings → Sources
|
|
2. Add a new source with type "Local Folder"
|
|
3. Configure the path and scan interval
|
|
4. Enable/disable the source as needed
|
|
|
|
## Document Upload
|
|
|
|
### Manual Upload
|
|
1. Click the upload button or drag files to the upload area
|
|
2. Select one or multiple files
|
|
3. Add tags during upload (optional)
|
|
4. Click "Upload" to start processing
|
|
|
|
### Drag and Drop
|
|
- Drag files directly from your file manager
|
|
- Drop anywhere on the document list page
|
|
- Multiple files can be dropped at once
|
|
|
|
### Upload Limits
|
|
- Maximum file size: Configurable (default 50MB)
|
|
- Supported formats: See [Supported File Types](#supported-file-types)
|
|
- Batch upload: Up to 100 files at once
|
|
|
|
## OCR Processing
|
|
|
|
### Automatic OCR
|
|
- Starts automatically after upload
|
|
- Processes documents in background
|
|
- Priority queue for smaller files
|
|
|
|
### OCR Settings
|
|
- **Language**: Select from 100+ languages
|
|
- **Preprocessing**: Enable image enhancement
|
|
- **Auto-rotation**: Correct document orientation
|
|
- **Quality**: Balance between speed and accuracy
|
|
|
|
### OCR Status Indicators
|
|
- 🟢 **Completed**: Full text extracted
|
|
- 🟡 **Processing**: OCR in progress
|
|
- 🔴 **Failed**: Error during processing
|
|
- ⚪ **Pending**: Waiting in queue
|
|
|
|
## Search Features
|
|
|
|
### Quick Search
|
|
- Available in the header on all pages
|
|
- Instant results as you type
|
|
- Shows top 5 matches with snippets
|
|
|
|
### Advanced Search Page
|
|
- Full search interface with all filters
|
|
- Export search results
|
|
- Save frequently used searches
|
|
- Search history
|
|
|
|
### Search Tips
|
|
1. Use quotes for exact phrases
|
|
2. Combine filters for precise results
|
|
3. Use wildcards: `inv*` matches invoice, inventory
|
|
4. Search in specific fields: `filename:report`
|
|
|
|
## Tags and Organization
|
|
|
|
### Creating Tags
|
|
1. Select document(s)
|
|
2. Click "Add Tag"
|
|
3. Enter tag name or select existing
|
|
4. Tags are color-coded for easy identification
|
|
|
|
### Tag Management
|
|
- Rename tags globally
|
|
- Merge similar tags
|
|
- Delete unused tags
|
|
- Set tag colors
|
|
|
|
### Smart Collections
|
|
Create saved searches based on:
|
|
- Tag combinations
|
|
- Date ranges
|
|
- File types
|
|
- Custom criteria
|
|
|
|
## User Settings
|
|
|
|
### Personal Preferences
|
|
- **Display**: List/grid default view
|
|
- **Language**: Interface language
|
|
- **Time Zone**: For accurate timestamps
|
|
- **Notifications**: Email/in-app alerts
|
|
|
|
### OCR Preferences
|
|
- Default OCR language
|
|
- Processing priority
|
|
- Image preprocessing options
|
|
- Batch size limits
|
|
|
|
### Search Settings
|
|
- Results per page
|
|
- Default sort order
|
|
- Snippet length
|
|
- Fuzzy search threshold
|
|
|
|
## Tips for Best Results
|
|
|
|
### OCR Quality
|
|
1. **Higher Resolution**: 300+ DPI produces better OCR results
|
|
2. **Clean Scans**: Avoid skewed or dirty documents
|
|
3. **Good Lighting**: For photo captures, ensure even lighting
|
|
4. **Text Contrast**: Black text on white background works best
|
|
|
|
### File Organization
|
|
1. **Consistent Naming**: Use descriptive, consistent file names
|
|
2. **Regular Uploads**: Don't let documents pile up
|
|
3. **Use Tags**: Tag documents immediately after upload
|
|
4. **Folder Structure**: Organize watch folders logically
|
|
|
|
### Search Optimization
|
|
1. **Use Filters**: Combine text search with filters
|
|
2. **Save Searches**: Save frequently used search queries
|
|
3. **Learn Syntax**: Master search operators for better results
|
|
4. **Index Regularly**: Ensure all documents are processed
|
|
|
|
### Performance Tips
|
|
1. **Batch Processing**: Upload similar documents together
|
|
2. **Off-Peak Hours**: Schedule large uploads during low-usage times
|
|
3. **Monitor Queue**: Check OCR queue status regularly
|
|
4. **Clean Up**: Remove outdated documents periodically
|
|
|
|
## Troubleshooting
|
|
|
|
### Common Issues
|
|
|
|
**OCR Not Starting**
|
|
- Check file size limits
|
|
- Verify supported file format
|
|
- Ensure OCR service is running
|
|
|
|
**Search Not Finding Documents**
|
|
- Confirm OCR completed successfully
|
|
- Check search syntax
|
|
- Try broader search terms
|
|
|
|
**Slow Performance**
|
|
- Review concurrent OCR job settings
|
|
- Check system resources
|
|
- Consider increasing memory limits
|
|
|
|
## Next Steps
|
|
|
|
- Explore the [API Reference](api-reference.md) for automation
|
|
- Learn about [advanced configuration](configuration.md)
|
|
- Set up [automated workflows](WATCH_FOLDER.md)
|
|
- Optimize [OCR performance](dev/OCR_OPTIMIZATION_GUIDE.md) |