perf3ct
fdc240fa5b
feat(webdav): track directory etags
...
✅ Core Optimizations Implemented
1. 📊 New Database Schema: Added webdav_directories table to track
directory ETags, file counts, and metadata
2. 🔍 Smart Directory Checking: Before deep scans, check directory
ETags with lightweight Depth: 0 PROPFIND requests
3. ΓÜí Skip Unchanged Directories: If directory ETag matches, skip the
entire deep scan
4. 🗂️ N-Depth Subdirectory Tracking: Recursively track all
subdirectories found during scans
5. 🎯 Individual Subdirectory Checks: When parent unchanged, check
each known subdirectory individually
🚀 Performance Benefits
Before: Every sync = Full Depth: infinity scan of entire directory
treeAfter:
- First sync: Full scan + directory tracking setup
- Subsequent syncs: Quick ETag checks → skip unchanged directories
entirely
- Changed directories: Only scan the specific changed subdirectories
📁 How It Works
1. Initial Request: PROPFIND Depth: 0 on /Documents → get directory
ETag
2. Database Check: Compare with stored ETag for /Documents
3. If Unchanged: Check each known subdirectory (/Documents/2024,
/Documents/Archive) individually
4. If Changed: Full recursive scan + update all directory tracking
data
2025-07-01 21:22:16 +00:00
Jon Fuller
1e655be542
Merge pull request #82 from readur/feat/merge-document-management-tabs
...
feat(client): combine the low confidence ocr tab with the bulk cleanup…
2025-07-01 13:55:10 -07:00
perf3ct
a2ea446e16
feat(client): update swagger ui endpoints
2025-07-01 20:54:45 +00:00
perf3ct
d2d7adb0ad
feat(client): update swagger ui endpoints
2025-07-01 19:00:29 +00:00
perf3ct
680cf5f526
feat(client): combine the low confience ocr tab with the bulk cleanup tab, and move debug menu
2025-07-01 18:52:28 +00:00
perf3ct
a8a6047214
fix(client): try to fix the lint issues on the debug page
2025-07-01 02:25:24 +00:00
Jon Fuller
1695692729
Merge pull request #81 from readur/feat/debug-page
...
feat(debug/pdf): debug page actually works, add ocrmypdf
2025-06-30 18:09:59 -07:00
perf3ct
8deca105b1
fix(client): try to fix the lint issues on the debug page
2025-07-01 01:05:40 +00:00
perf3ct
f7018575d8
feat(pdf): implement ocrmypdf to extract text from PDFs
2025-07-01 00:56:48 +00:00
Jon Fuller
59e80a1b92
Merge branch 'main' into feat/debug-page
2025-06-30 17:19:31 -07:00
perf3ct
2a1eeeda50
feat(debug): debug page actually works and does something
2025-07-01 00:15:48 +00:00
perf3ct
f26ab1e367
fix(pdf): resolve PDF wordcount error
2025-07-01 00:10:49 +00:00
Jon Fuller
799fd88f41
Merge pull request #79 from readur/renovate/playwright-monorepo
...
chore(deps): update dependency @playwright/test to v1.53.2
2025-06-30 16:31:26 -07:00
Jon Fuller
c8fa0b2ac7
Merge pull request #80 from readur/renovate/reqwest-0.x-lockfile
...
fix(deps): update rust crate reqwest to v0.12.21
2025-06-30 16:31:16 -07:00
Jon Fuller
75e6594f1a
Merge pull request #76 from readur/feat/wrangle-ignored-files
...
feat(server/client): wrangle ignored files and failed files
2025-06-30 16:31:00 -07:00
perf3ct
dd90e48fd2
feat(server): mark documents with 0 words as failed, and fix webdav unit tests
2025-06-30 22:43:25 +00:00
perf3ct
bf073132a1
fix(tests): fix documents tests
2025-06-30 21:56:21 +00:00
perf3ct
72781aa7b9
fix(tests): update frontend tests for new FailedDocument view
2025-06-30 21:20:34 +00:00
renovate[bot]
3fca544011
fix(deps): update rust crate reqwest to v0.12.21
2025-06-30 20:13:56 +00:00
Jon Fuller
ccab01f489
Create codeql.yml
2025-06-30 13:12:28 -07:00
perf3ct
b344b69da7
feat(server): fix serialization issues
2025-06-30 19:40:05 +00:00
perf3ct
d9b695f0bd
feat(server/client): add metadata to file view
2025-06-30 19:13:16 +00:00
renovate[bot]
4d0af80971
chore(deps): update dependency @playwright/test to v1.53.2
2025-06-30 18:56:41 +00:00
perf3ct
bf98aaab85
feat(client): fix faileddocumentviewer
2025-06-30 00:59:34 +00:00
perf3ct
5f10a8b82c
feat(server): continue to try to wrangle the failed and ignored documents
2025-06-29 23:27:51 +00:00
perf3ct
879425c192
feat(client): rename FailedOcrPage to DocumentManagementPage
2025-06-29 21:48:44 +00:00
perf3ct
cf8b9444ee
feat(client): remove demo statement from frontend
2025-06-29 21:44:11 +00:00
Jon Fuller
929ea96e6f
Merge pull request #73 from readur/feat/better-failure-handling
...
feat(server/client): add failed_documents table to handle failures, a…
2025-06-28 23:03:23 -07:00
perf3ct
ed3acc0b88
fix(tests): fix ocr endpoint tests due to our endpoint change
2025-06-29 05:35:49 +00:00
perf3ct
68fb776f7b
fix(tests): fix ocr endpoint tests due to our endpoint change
2025-06-28 23:43:04 +00:00
perf3ct
fa7fd86622
fix(tests): fix ocr endpoint tests due to our endpoint change
2025-06-28 23:11:47 +00:00
perf3ct
2797cb4367
fix(tests): fix failing document integration test, again
2025-06-28 22:58:20 +00:00
perf3ct
660117ed9f
fix(tests): fix failing integration tests, again
2025-06-28 22:54:41 +00:00
perf3ct
28a7e4eb45
fix(tests): resolve a whole lot of test issues
2025-06-28 22:50:40 +00:00
perf3ct
2b29032a42
fix(server): resolve compilation errors in constraint_validation.rs
2025-06-28 22:04:01 +00:00
perf3ct
df254d59e3
feat(server/client): resolve failing tests
2025-06-28 21:21:05 +00:00
perf3ct
34bc207e39
feat(server/client): add failed_documents table to handle failures, and move logic of failures
2025-06-28 20:52:58 +00:00
Jon Fuller
881e4c5a8e
Merge pull request #72 from readur/feat/better-db-tests
...
feat(tests): add regression tests and better sql type safety tests
2025-06-28 12:43:52 -07:00
perf3ct
25d6ecce6c
feat(tests): add regression tests and better sql type safety tests
2025-06-28 19:25:15 +00:00
perf3ct
e2633d7162
feat(swagger): add missing oidc endpoints into swagger ui
2025-06-28 19:19:48 +00:00
perf3ct
a792d2e6fd
fix(server): resolve incorrect db type
2025-06-28 18:41:48 +00:00
perf3ct
d18daa2c17
fix(server): resolve compilation issues from queue.rs
2025-06-28 18:15:55 +00:00
perf3ct
6dd580fa2f
fix(ocr_status): populate the ocr queue with pending jobs and add easy 'retry' button
2025-06-28 18:08:00 +00:00
perf3ct
ac31573db5
feat(migration): disable OCR consistency trigger for OCR confidence backfill
2025-06-28 17:23:35 +00:00
Jon Fuller
a314f64ca9
Merge pull request #69 from readur/fix/ocr-confidence-1
...
fix(server/client): fix incorrect OCR measurements
2025-06-28 09:53:56 -07:00
perf3ct
7247f74456
feat(tests): create generic migration tests
2025-06-28 16:38:12 +00:00
perf3ct
dc5cb99462
feat(migration): instead of hardcoded guessing, re-enter those documents into the queue
2025-06-28 14:53:45 +00:00
perf3ct
3ad4bbf456
fix(migrations): resolve issue in migration for ocr confidence
2025-06-28 14:51:06 +00:00
Jon Fuller
7c7749338d
Merge pull request #71 from readur/feat/view-failed-document
...
feat(frontend): view failed document and link to details
2025-06-28 07:37:51 -07:00
aaldebs99
83217e508e
feat(frontend): view failed document and link to details
2025-06-28 05:10:38 +00:00