Commit Graph

692 Commits

Author SHA1 Message Date
perf3ct a381cdd12c feat(webdav): also fix the parser to include directories, and add tests 2025-07-01 22:03:06 +00:00
perf3ct c1dbd06df2 feat(tests): add unit tests for new webdav functionality 2025-07-01 21:39:31 +00:00
Jon Fuller c164a4666f Merge pull request #39 from readur/renovate/vite-7.x
chore(deps): update dependency vite to v7
2025-07-01 14:24:22 -07:00
Jon Fuller 92b4ba7829 Merge pull request #83 from readur/renovate/reqwest-0.x-lockfile
fix(deps): update rust crate reqwest to v0.12.22
2025-07-01 14:24:13 -07:00
Jon Fuller fc510af644 Merge pull request #84 from readur/renovate/aws-sdk-rust-monorepo
fix(deps): update rust crate aws-config to v1.8.1
2025-07-01 14:24:03 -07:00
perf3ct 92b21350db feat(webdav): track directory etags
✅ Core Optimizations Implemented

  1. 📊 New Database Schema: Added webdav_directories table to track
directory ETags, file counts, and metadata
  2. 🔍 Smart Directory Checking: Before deep scans, check directory
ETags with lightweight Depth: 0 PROPFIND requests
  3. ΓÜí Skip Unchanged Directories: If directory ETag matches, skip the
entire deep scan
  4. 🗂️ N-Depth Subdirectory Tracking: Recursively track all
subdirectories found during scans
  5. 🎯 Individual Subdirectory Checks: When parent unchanged, check
each known subdirectory individually

  🚀 Performance Benefits

  Before: Every sync = Full Depth: infinity scan of entire directory
treeAfter:
  - First sync: Full scan + directory tracking setup
  - Subsequent syncs: Quick ETag checks → skip unchanged directories
entirely
  - Changed directories: Only scan the specific changed subdirectories

  📁 How It Works

  1. Initial Request: PROPFIND Depth: 0 on /Documents → get directory
ETag
  2. Database Check: Compare with stored ETag for /Documents
  3. If Unchanged: Check each known subdirectory (/Documents/2024,
/Documents/Archive) individually
  4. If Changed: Full recursive scan + update all directory tracking
data
2025-07-01 21:22:16 +00:00
renovate[bot] ba1cdfc9b7 fix(deps): update rust crate aws-config to v1.8.1 2025-07-01 20:56:42 +00:00
Jon Fuller 36dcbaed98 Merge pull request #82 from readur/feat/merge-document-management-tabs
feat(client): combine the low confidence ocr tab with the bulk cleanup…
2025-07-01 13:55:10 -07:00
perf3ct 6a23a407bf feat(client): update swagger ui endpoints 2025-07-01 20:54:45 +00:00
perf3ct c18fde6fc1 feat(client): update swagger ui endpoints 2025-07-01 19:00:29 +00:00
renovate[bot] 13a2b39bcd fix(deps): update rust crate reqwest to v0.12.22 2025-07-01 18:55:12 +00:00
Jon Fuller 2e1a05fc8d Merge branch 'main' into feat/multiple-ocr-languages 2025-07-01 11:53:42 -07:00
perf3ct ee4841c027 feat(client): combine the low confience ocr tab with the bulk cleanup tab, and move debug menu 2025-07-01 18:52:28 +00:00
perf3ct 893ba331f5 fix(client): try to fix the lint issues on the debug page 2025-07-01 02:25:24 +00:00
Jon Fuller 83b046acc6 Merge pull request #81 from readur/feat/debug-page
feat(debug/pdf): debug page actually works, add ocrmypdf
2025-06-30 18:09:59 -07:00
perf3ct f22673bbc3 fix(client): try to fix the lint issues on the debug page 2025-07-01 01:05:40 +00:00
perf3ct df281f3b26 feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
Jon Fuller 706e20f35c Merge branch 'main' into feat/debug-page 2025-06-30 17:19:31 -07:00
perf3ct 231f88f038 feat(debug): debug page actually works and does something 2025-07-01 00:15:48 +00:00
perf3ct 0052032772 fix(pdf): resolve PDF wordcount error 2025-07-01 00:10:49 +00:00
Jon Fuller b307cfd509 Merge pull request #79 from readur/renovate/playwright-monorepo
chore(deps): update dependency @playwright/test to v1.53.2
2025-06-30 16:31:26 -07:00
Jon Fuller 1fca815485 Merge pull request #80 from readur/renovate/reqwest-0.x-lockfile
fix(deps): update rust crate reqwest to v0.12.21
2025-06-30 16:31:16 -07:00
Jon Fuller c848e899f3 Merge pull request #76 from readur/feat/wrangle-ignored-files
feat(server/client): wrangle ignored files and failed files
2025-06-30 16:31:00 -07:00
perf3ct 830f9d0b38 feat(server): mark documents with 0 words as failed, and fix webdav unit tests 2025-06-30 22:43:25 +00:00
perf3ct 69279344cb fix(tests): fix documents tests 2025-06-30 21:56:21 +00:00
perf3ct 49c34aac5a fix(tests): update frontend tests for new FailedDocument view 2025-06-30 21:20:34 +00:00
renovate[bot] 918a1850cb fix(deps): update rust crate reqwest to v0.12.21 2025-06-30 20:13:56 +00:00
Jon Fuller ca193442da Create codeql.yml 2025-06-30 13:12:28 -07:00
perf3ct b38c1fca07 feat(server): fix serialization issues 2025-06-30 19:40:05 +00:00
perf3ct 9e43df2fbe feat(server/client): add metadata to file view 2025-06-30 19:13:16 +00:00
renovate[bot] 339a603625 chore(deps): update dependency @playwright/test to v1.53.2 2025-06-30 18:56:41 +00:00
perf3ct 1695787f1d feat(client): fix faileddocumentviewer 2025-06-30 00:59:34 +00:00
perf3ct fef28a33c6 feat(server): continue to try to wrangle the failed and ignored documents 2025-06-29 23:27:51 +00:00
perf3ct 87cfab9ff8 fix(tests): resolve compilation error in the multiple OCR functionality 2025-06-29 23:21:42 +00:00
perf3ct 197afc19f4 feat(tests): implement and update tests for multiple OCR languages 2025-06-29 23:03:37 +00:00
renovate[bot] 8e6365c0df chore(deps): update dependency vite to v7 2025-06-29 22:56:05 +00:00
perf3ct 6b6890d529 feat(server/client): support multiple OCR languages 2025-06-29 22:51:06 +00:00
perf3ct e8a7d7bf0f feat(client): rename FailedOcrPage to DocumentManagementPage 2025-06-29 21:48:44 +00:00
perf3ct 4e34992cb5 feat(client): remove demo statement from frontend 2025-06-29 21:44:11 +00:00
Jon Fuller 6aaa3551e9 Merge pull request #73 from readur/feat/better-failure-handling
feat(server/client): add failed_documents table to handle failures, a…
2025-06-28 23:03:23 -07:00
perf3ct d218b4289f fix(tests): fix ocr endpoint tests due to our endpoint change 2025-06-29 05:35:49 +00:00
perf3ct 9e09bc019e fix(tests): fix ocr endpoint tests due to our endpoint change 2025-06-28 23:43:04 +00:00
perf3ct 52b2cd6d88 fix(tests): fix ocr endpoint tests due to our endpoint change 2025-06-28 23:11:47 +00:00
perf3ct 971f9b69e0 fix(tests): fix failing document integration test, again 2025-06-28 22:58:20 +00:00
perf3ct 157181b55b fix(tests): fix failing integration tests, again 2025-06-28 22:54:41 +00:00
perf3ct fbf89c213d fix(tests): resolve a whole lot of test issues 2025-06-28 22:50:40 +00:00
perf3ct edd0c7514f fix(server): resolve compilation errors in constraint_validation.rs 2025-06-28 22:04:01 +00:00
perf3ct 97fa50c1b5 feat(server/client): resolve failing tests 2025-06-28 21:21:05 +00:00
perf3ct 84577806ef feat(server/client): add failed_documents table to handle failures, and move logic of failures 2025-06-28 20:52:58 +00:00
Jon Fuller fce56b660b Merge pull request #72 from readur/feat/better-db-tests
feat(tests): add regression tests and better sql type safety tests
2025-06-28 12:43:52 -07:00