Commit Graph

938 Commits

Author SHA1 Message Date
perf3ct 7cf1fd623c
feat(ci): try to prepull containers 2025-09-02 22:05:02 +00:00
perf3ct 90be003874
feat(db): add more guardrails for null bytes 2025-09-02 21:26:03 +00:00
perf3ct 11ffe9d0e5
feat(ci): add dockerhub auth 2025-09-02 21:21:37 +00:00
perf3ct 483d89132f
feat(office): add documentation around using antiword/catdoc for `doc` functionality 2025-09-02 20:29:17 +00:00
perf3ct 149c3b9a3f
feat(office): yeet unused fallback strategy 2025-09-02 03:47:20 +00:00
perf3ct d5d6d2edb4
feat(office): xml extraction seems to work now 2025-09-02 01:22:19 +00:00
perf3ct 774efd1140
refactor(server): remove XML vs library comparison functionality
Remove all comparison-related code used to evaluate XML vs library-based
Office document extraction. The XML approach has proven superior, so the
comparison functionality is no longer needed.

Changes:
- Remove extraction_comparator.rs (entire comparison engine)
- Remove test_extraction_comparison.rs binary
- Remove comparison mode logic from enhanced.rs
- Simplify fallback_strategy.rs to use XML extraction only
- Update OCR service to use XML extraction as primary method
- Clean up database migration to remove comparison-specific settings
- Remove test_extraction binary from Cargo.toml
- Update integration tests to work with simplified extraction

The Office document extraction now flows directly to XML-based
extraction
without any comparison checks, maintaining the superior extraction
quality
while removing unnecessary complexity.
2025-09-02 01:22:19 +00:00
perf3ct 73525eca02
feat(office): add library-based and xml-based parsing 2025-09-02 00:25:06 +00:00
perf3ct 57a5d2ab15
feat(office): add xml parsing 2025-09-01 22:32:42 +00:00
perf3ct 325731aa04
feat(office): create legitimate office files for testing 2025-09-01 22:07:59 +00:00
perf3ct b8bf7c9585
feat(office): use catdoc and antiword to convert doc 2025-09-01 21:49:30 +00:00
perf3ct 78af7e7861
feat(office): use actual packages for extraction 2025-09-01 21:21:22 +00:00
perf3ct 546b41b462
feat(office): try to resolve docx/doc not working 2025-09-01 19:58:06 +00:00
perf3ct 4dbd1aa5d6
fix(errors): resolve the sql casting, and introduce unit test to prevent this from happening again 2025-09-01 18:15:52 +00:00
perf3ct 10d461aeac
fix(errors): resolve issues with error handling 2025-09-01 18:01:36 +00:00
Jon Fuller dce2517d10
Update README.md 2025-08-25 18:25:38 -07:00
Jon Fuller 752b115fb5
Merge pull request #188 from readur/renovate/alpine-helm-3.x
chore(deps): update alpine/helm docker tag to v3.18.6
2025-08-25 13:07:49 -07:00
Jon Fuller 99f5d5ffd3
Merge pull request #189 from readur/renovate/wiremock-0.x-lockfile
chore(deps): update rust crate wiremock to v0.6.5
2025-08-25 13:07:38 -07:00
Jon Fuller a4ce03c2b5
Merge pull request #190 from readur/renovate/regex-1.x-lockfile
fix(deps): update rust crate regex to v1.11.2
2025-08-25 13:06:21 -07:00
Jon Fuller aff48568ed
Merge pull request #191 from readur/renovate/url-2.x-lockfile
fix(deps): update rust crate url to v2.5.7
2025-08-25 13:06:08 -07:00
renovate[bot] 587a36215a
fix(deps): update rust crate url to v2.5.7 2025-08-25 04:59:03 +00:00
renovate[bot] 4c0179ede8
fix(deps): update rust crate regex to v1.11.2 2025-08-25 04:58:55 +00:00
renovate[bot] 8059b39b44
chore(deps): update rust crate wiremock to v0.6.5 2025-08-25 03:35:44 +00:00
Jon Fuller f66ad3a4e0
Merge pull request #184 from readur/renovate/material-ui-monorepo
fix(deps): update dependency @mui/x-date-pickers to v8.10.2
2025-08-24 20:34:37 -07:00
renovate[bot] f1e5455a31
chore(deps): update alpine/helm docker tag to v3.18.6 2025-08-25 03:34:32 +00:00
Jon Fuller 9ba5fa16ba
Merge pull request #183 from readur/renovate/react-monorepo
chore(deps): update dependency @types/react to v19.1.11
2025-08-24 20:34:26 -07:00
Jon Fuller cca882c444
Merge pull request #185 from readur/renovate/react-router-monorepo
fix(deps): update dependency react-router-dom to v7.8.2
2025-08-24 20:34:13 -07:00
Jon Fuller a2196b3366
Merge pull request #186 from readur/renovate/serde_json-1.x-lockfile
fix(deps): update rust crate serde_json to v1.0.143
2025-08-24 20:33:59 -07:00
perf3ct 8f9c7ea6c4
feat(mkdocs): resolve silly sidebar showing on mobile 2025-08-24 18:21:55 +00:00
Jon Fuller 40dcaab0f8
Merge pull request #187 from readur/feat/simplify-webdav-metrics
feat(metrics): try to simplify webdav metrics some
2025-08-23 16:24:03 -07:00
perf3ct b5b5019eba
fix(tests): resolve compilation error in last unit test 2025-08-23 22:48:41 +00:00
perf3ct 07b52ab28a
feat(metrics): update appstate for metrics changes 2025-08-23 22:27:50 +00:00
perf3ct f6eb7ba49f
feat(metrics): try to simplify webdav metrics some 2025-08-23 22:17:40 +00:00
renovate[bot] cbb7cac0eb
fix(deps): update rust crate serde_json to v1.0.143 2025-08-23 21:53:31 +00:00
renovate[bot] 77b3b108bd
fix(deps): update dependency react-router-dom to v7.8.2 2025-08-23 21:52:56 +00:00
Jon Fuller 83ee0ce9a5
Merge pull request #182 from readur/feat/better-webdav-errors
feat(webdav): try to do better webdav errors to not slam webdav endpoints
2025-08-23 14:52:13 -07:00
perf3ct 4b5ee94724
fix(metrics): casting is the name of the game 2025-08-23 20:31:56 +00:00
renovate[bot] 2c00b7ef0c
fix(deps): update dependency @mui/x-date-pickers to v8.10.2 2025-08-23 19:20:55 +00:00
renovate[bot] 01405c137d
chore(deps): update dependency @types/react to v19.1.11 2025-08-23 19:20:47 +00:00
Jon Fuller 5d0f1ac1bd
Merge pull request #174 from readur/renovate/thiserror-2.x-lockfile
fix(deps): update rust crate thiserror to v2.0.16
2025-08-23 12:19:27 -07:00
Jon Fuller bc54806d0b
Merge pull request #179 from readur/renovate/bcrypt-0.x-lockfile
fix(deps): update rust crate bcrypt to v0.17.1
2025-08-23 12:19:18 -07:00
Jon Fuller ce3c3b8ad2
Merge pull request #181 from readur/renovate/vite-7.x-lockfile
chore(deps): update dependency vite to v7.1.3
2025-08-23 12:19:09 -07:00
Jon Fuller 1fa7179fbd
Merge pull request #180 from readur/renovate/vitejs-plugin-react-5.x-lockfile
chore(deps): update dependency @vitejs/plugin-react to v5.0.1
2025-08-23 12:19:01 -07:00
perf3ct 1b4573f658
feat(webdav): resolve failing migration tests, and implement better error handling 2025-08-23 18:52:52 +00:00
perf3ct 08cce05d1a
feat(tests): resolve issues with db macros in test compilation 2025-08-21 21:08:05 +00:00
perf3ct d02180cc4e
feat(tests): resolve issues with db macros in test compilation 2025-08-21 19:42:23 +00:00
perf3ct 00795ace02
feat(webdav): fix all the wonderful compilation issues 2025-08-21 05:07:28 +00:00
perf3ct 18832b9c12
feat(webdav): fix all the wonderful compilation issues 2025-08-21 04:29:36 +00:00
perf3ct b7dd64c8f6
feat(webdav): try to do better webdav errors to not slam webdav endpoints 2025-08-20 21:59:14 +00:00
renovate[bot] 5600f083b9
fix(deps): update rust crate thiserror to v2.0.16 2025-08-20 07:05:10 +00:00