Commit Graph

339 Commits

Author SHA1 Message Date
perf3ct f4fa33807c
feat(ocr): resolve issue with null bytes coming out of OCR documents 2025-11-02 11:55:27 -08:00
aaldebs99 58652bc300 Feat(UI): hide UI login when ALLOW_LOCAL_AUTH is set to false 2025-10-30 22:10:04 +00:00
perf3ct 65c49ef4f2
feat(ocr): implement new dev stack and allow for more numbers in ocr documents 2025-10-28 14:34:34 -07:00
perf3ct d5963585fd
feat(ocr): soften the requirements around OCR, and update the UI to better handle issues in word count 2025-10-18 14:31:10 -07:00
aaldebs99 d46abbb5a4
Merge pull request #269 from readur/oidc-improvements
Fix(OIDC): redirect to frontend, jwt, and callback handling
2025-10-11 21:08:11 -07:00
aaldebs99 943a3eefae fix(OIDC): redirect to frontend, jwt, and callback handling 2025-10-12 02:15:47 +00:00
aaldebs99 a23edca938 fix(OIDC): redirect to frontend after OIDC credentials 2025-10-12 01:11:47 +00:00
aaldebs99 8900f7d342
Merge pull request #268 from readur/oidc-improvements
fix(OIDC): handle confedential client providers
2025-10-11 17:42:01 -07:00
aaldebs99 a5edcfdd1d fix(OIDC): handle confedential client providers 2025-10-12 00:05:06 +00:00
aaldebs99 e81c60192b
Merge pull request #267 from readur/oidc-improvements
WIP: OIDC improvements
2025-10-11 16:16:11 -07:00
aaldebs99 de785f7745 fix(tests): so... many... 2025-10-11 20:31:02 +00:00
aaldebs99 64a66b987f fix(tests): make those optional too 2025-10-11 19:30:17 +00:00
aaldebs99 1a7e6d7fa9 refactor(config): make oidc_auto_register and allow_local_auth optional 2025-10-11 19:07:18 +00:00
aaldebs99 f7d6eeac6f fix(unit-tests): add missing fields to unit tests 2025-10-11 16:21:26 +00:00
aaldebs99 0032a30bb1 feat(oidc): add option for auto-register, local login, and user matching by email 2025-10-11 04:55:19 +00:00
perf3ct e7574cb0da
feat(ui): handle strange responses that the UI could recieve 2025-10-05 13:45:10 -07:00
perf3ct c6fc8cbb3a
feat(tests): also add config tests 2025-09-24 20:07:41 +00:00
perf3ct b435437ad3
feat(server): also support individual DB environment variables instead of DATABASE_URL 2025-09-24 19:53:20 +00:00
perf3ct aa5bd77753
feat(webdav): get rid of complex loop detection 2025-09-09 02:11:57 +00:00
perf3ct 88c376f655
feat(webdav): add some stress test utilities 2025-09-09 01:38:36 +00:00
perf3ct 7863b9100f
feat(ocr): no longer add explicit section / page break 2025-09-05 00:06:09 +00:00
perf3ct 43b679f59b
fix(server): resolve compilation warnings and fix test that expects no pass, to have it actually expect pass 2025-09-02 22:51:17 +00:00
perf3ct 90be003874
feat(db): add more guardrails for null bytes 2025-09-02 21:26:03 +00:00
perf3ct 483d89132f
feat(office): add documentation around using antiword/catdoc for `doc` functionality 2025-09-02 20:29:17 +00:00
perf3ct 149c3b9a3f
feat(office): yeet unused fallback strategy 2025-09-02 03:47:20 +00:00
perf3ct d5d6d2edb4
feat(office): xml extraction seems to work now 2025-09-02 01:22:19 +00:00
perf3ct 774efd1140
refactor(server): remove XML vs library comparison functionality
Remove all comparison-related code used to evaluate XML vs library-based
Office document extraction. The XML approach has proven superior, so the
comparison functionality is no longer needed.

Changes:
- Remove extraction_comparator.rs (entire comparison engine)
- Remove test_extraction_comparison.rs binary
- Remove comparison mode logic from enhanced.rs
- Simplify fallback_strategy.rs to use XML extraction only
- Update OCR service to use XML extraction as primary method
- Clean up database migration to remove comparison-specific settings
- Remove test_extraction binary from Cargo.toml
- Update integration tests to work with simplified extraction

The Office document extraction now flows directly to XML-based
extraction
without any comparison checks, maintaining the superior extraction
quality
while removing unnecessary complexity.
2025-09-02 01:22:19 +00:00
perf3ct 73525eca02
feat(office): add library-based and xml-based parsing 2025-09-02 00:25:06 +00:00
perf3ct 57a5d2ab15
feat(office): add xml parsing 2025-09-01 22:32:42 +00:00
perf3ct b8bf7c9585
feat(office): use catdoc and antiword to convert doc 2025-09-01 21:49:30 +00:00
perf3ct 78af7e7861
feat(office): use actual packages for extraction 2025-09-01 21:21:22 +00:00
perf3ct 546b41b462
feat(office): try to resolve docx/doc not working 2025-09-01 19:58:06 +00:00
perf3ct 4dbd1aa5d6
fix(errors): resolve the sql casting, and introduce unit test to prevent this from happening again 2025-09-01 18:15:52 +00:00
perf3ct 10d461aeac
fix(errors): resolve issues with error handling 2025-09-01 18:01:36 +00:00
perf3ct f6eb7ba49f
feat(metrics): try to simplify webdav metrics some 2025-08-23 22:17:40 +00:00
perf3ct 4b5ee94724
fix(metrics): casting is the name of the game 2025-08-23 20:31:56 +00:00
perf3ct 1b4573f658
feat(webdav): resolve failing migration tests, and implement better error handling 2025-08-23 18:52:52 +00:00
perf3ct 00795ace02
feat(webdav): fix all the wonderful compilation issues 2025-08-21 05:07:28 +00:00
perf3ct 18832b9c12
feat(webdav): fix all the wonderful compilation issues 2025-08-21 04:29:36 +00:00
perf3ct b7dd64c8f6
feat(webdav): try to do better webdav errors to not slam webdav endpoints 2025-08-20 21:59:14 +00:00
perf3ct d793509af9
feat(source): update names of sourceerror, and update tests 2025-08-17 22:37:41 +00:00
perf3ct 6a64d9e6ed
feat(source): implement generic "SourceError" and then have it be propagated as "WebDAVerror", etc. 2025-08-17 22:05:58 +00:00
perf3ct cddba50799
feat(webdav): webdav error management and tests 2025-08-17 20:16:46 +00:00
perf3ct 93c2863d01
feat(webdav): support capturing individual directory errors in webdav 2025-08-14 16:24:05 +00:00
perf3ct 67ae68745c
fix(dev): remove unneeded docs 2025-08-13 20:51:13 +00:00
perf3ct caf4e7cf7d
feat(docs): update docs for S3 backend implemenation 2025-08-13 20:24:59 +00:00
perf3ct 4b6e0820b7
feat(websocket): update websockets and websocket tests so that they actually pass 2025-08-11 20:08:36 +00:00
perf3ct 0fb250e28c
feat(security): this was just pain 2025-08-11 01:13:29 +00:00
perf3ct 080263a9ac
fix(tests): resolve issues with s3 tests 2025-08-11 00:54:09 +00:00
perf3ct cb3b3f05b8
fix(tests): migrate auto resume tests to use new test app state management 2025-08-02 18:29:12 +00:00