Readur/tests/test_pdfs
perf3ct 9c051b6f55
feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again
2025-07-14 16:36:32 +00:00
..
continuous_text.pdf feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
corrupted_structure.pdf feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again 2025-07-14 16:36:32 +00:00
edge_cases_realistic.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
fake_pdf.pdf feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again 2025-07-14 16:36:32 +00:00
invalid_font_encoding.pdf feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again 2025-07-14 16:36:32 +00:00
large_file.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
malformed_content_stream.pdf feat(ocr): gracefully handle problematic PDFs in all the ways, create tests so that it doesn't happen again 2025-07-14 16:36:32 +00:00
medium_file.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
mixed_content.txt feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
multiline_text.txt feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
multipage_document.pdf feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
multipage_realistic.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
normal_spacing.txt feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
normal_text.pdf feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
oversized_file.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
problematic_encoding.pdf feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00
small_file.pdf feat(tests): add more test pdfs 2025-07-12 14:04:04 -07:00
special_chars.txt feat(pdf): implement ocrmypdf to extract text from PDFs 2025-07-01 00:56:48 +00:00