| .. |
|
continuous_text.pdf
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
corrupted_structure.pdf
|
feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again
|
2025-07-14 16:36:32 +00:00 |
|
edge_cases_realistic.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
fake_pdf.pdf
|
feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again
|
2025-07-14 16:36:32 +00:00 |
|
invalid_font_encoding.pdf
|
feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again
|
2025-07-14 16:36:32 +00:00 |
|
large_file.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
malformed_content_stream.pdf
|
feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again
|
2025-07-14 16:36:32 +00:00 |
|
medium_file.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
mixed_content.txt
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
multiline_text.txt
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
multipage_document.pdf
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
multipage_realistic.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
normal_spacing.txt
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
normal_text.pdf
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
oversized_file.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
problematic_encoding.pdf
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |
|
small_file.pdf
|
feat(tests): add more test pdfs
|
2025-07-12 14:04:04 -07:00 |
|
special_chars.txt
|
feat(pdf): implement ocrmypdf to extract text from PDFs
|
2025-07-01 00:56:48 +00:00 |