MiniPdf vs Reference PDF Comparison Report
April 30, 2026 · View on GitHub
Generated: 2026-04-30T11:58:42.188069
Summary
| # | Test Case | Valid | Text Sim | Visual Avg | Pages (M/R) | Overall |
|---|---|---|---|---|---|---|
| 1 | 🟢 docx_classic01_single_paragraph | ✅ | 1.0 | 0.997 | 1/1 | 0.9988 |
| 2 | 🟢 docx_classic02_multiple_paragraphs | ✅ | 1.0 | 0.9831 | 1/1 | 0.9932 |
| 3 | 🟢 docx_classic03_headings | ✅ | 1.0 | 0.9916 | 1/1 | 0.9966 |
| 4 | 🟢 docx_classic04_bold_italic | ✅ | 1.0 | 0.9982 | 1/1 | 0.9993 |
| 5 | 🟢 docx_classic05_font_sizes | ✅ | 1.0 | 0.9902 | 1/1 | 0.9961 |
| 6 | 🟢 docx_classic06_font_colors | ✅ | 1.0 | 0.9991 | 1/1 | 0.9996 |
| 7 | 🟢 docx_classic07_alignment | ✅ | 1.0 | 0.9887 | 1/1 | 0.9955 |
| 8 | 🟢 docx_classic08_bullet_list | ✅ | 1.0 | 0.9967 | 1/1 | 0.9987 |
| 9 | 🟢 docx_classic09_numbered_list | ✅ | 1.0 | 0.9944 | 1/1 | 0.9978 |
| 10 | 🟢 docx_classic10_simple_table | ✅ | 1.0 | 0.9905 | 1/1 | 0.9962 |
| 11 | 🟢 docx_classic11_table_with_shading | ✅ | 1.0 | 0.9797 | 1/1 | 0.9919 |
| 12 | 🟢 docx_classic12_merged_cells_table | ✅ | 1.0 | 0.9919 | 1/1 | 0.9968 |
| 13 | 🟡 docx_classic13_long_document | ✅ | 1.0 | 0.6873 | 3/4 | 0.7749 |
| 14 | 🟢 docx_classic14_mixed_content | ✅ | 1.0 | 0.9761 | 1/1 | 0.9904 |
| 15 | 🟢 docx_classic15_indentation | ✅ | 1.0 | 0.983 | 1/1 | 0.9932 |
| 16 | 🟢 docx_classic16_line_spacing | ✅ | 1.0 | 0.9826 | 1/1 | 0.993 |
| 17 | 🟢 docx_classic17_page_break | ✅ | 1.0 | 0.9978 | 3/3 | 0.9991 |
| 18 | 🟢 docx_classic18_embedded_image | ✅ | 1.0 | 0.9934 | 1/1 | 0.9974 |
| 19 | 🟢 docx_classic19_multiple_images | ✅ | 1.0 | 0.9885 | 1/1 | 0.9954 |
| 20 | 🟢 docx_classic20_table_with_many_rows | ✅ | 1.0 | 0.9652 | 1/1 | 0.9861 |
| 21 | 🟢 docx_classic21_nested_lists | ✅ | 1.0 | 0.9932 | 1/1 | 0.9973 |
| 22 | 🟢 docx_classic22_horizontal_rule | ✅ | 1.0 | 0.9929 | 1/1 | 0.9972 |
| 23 | 🟢 docx_classic23_mixed_formatting_runs | ✅ | 1.0 | 0.9968 | 1/1 | 0.9987 |
| 24 | 🟢 docx_classic24_two_column_table_layout | ✅ | 0.7907 | 0.9924 | 1/1 | 0.9132 |
| 25 | 🟢 docx_classic25_title_and_subtitle | ✅ | 1.0 | 0.9938 | 2/2 | 0.9975 |
| 26 | 🟢 docx_classic26_table_alignment | ✅ | 1.0 | 0.9914 | 1/1 | 0.9966 |
| 27 | 🟢 docx_classic27_long_paragraph_wrapping | ✅ | 1.0 | 0.9358 | 1/1 | 0.9743 |
| 28 | 🟢 docx_classic28_special_characters | ✅ | 1.0 | 0.9927 | 1/1 | 0.9971 |
| 29 | 🟢 docx_classic29_table_with_image | ✅ | 0.9728 | 0.9899 | 1/1 | 0.9851 |
| 30 | 🟢 docx_classic30_comprehensive_report | ✅ | 1.0 | 0.7788 | 3/3 | 0.9115 |
| 31 | 🟢 docx_classic31_product_card_with_image | ✅ | 1.0 | 0.9776 | 1/1 | 0.991 |
| 32 | 🟢 docx_classic31_strikethrough_text | ✅ | 1.0 | 0.9917 | 1/1 | 0.9967 |
| 33 | 🟢 docx_classic32_company_logo_header | ✅ | 1.0 | 0.9546 | 1/1 | 0.9818 |
| 34 | 🟢 docx_classic32_superscript_subscript | ✅ | 0.9762 | 0.9958 | 1/1 | 0.9888 |
| 35 | 🟢 docx_classic33_highlighted_text | ✅ | 1.0 | 0.9464 | 1/1 | 0.9786 |
| 36 | 🟢 docx_classic33_two_products_side_by_side | ✅ | 1.0 | 0.9794 | 1/1 | 0.9918 |
| 37 | 🔴 docx_classic34_employee_directory_with_photo | ✅ | 1.0 | 0.4892 | 1/2 | 0.6957 |
| 38 | 🟢 docx_classic34_paragraph_borders | ✅ | 1.0 | 0.9809 | 1/1 | 0.9924 |
| 39 | 🟢 docx_classic35_inventory_with_product_photos | ✅ | 1.0 | 0.9783 | 2/2 | 0.9913 |
| 40 | 🟡 docx_classic35_tab_stops | ✅ | 0.64 | 0.9903 | 1/1 | 0.8521 |
| 41 | 🟢 docx_classic36_invoice_with_logo | ✅ | 1.0 | 0.964 | 1/1 | 0.9856 |
| 42 | 🟢 docx_classic36_wide_table | ✅ | 1.0 | 0.979 | 1/1 | 0.9916 |
| 43 | 🟢 docx_classic37_nested_table | ✅ | 0.963 | 0.9913 | 1/1 | 0.9817 |
| 44 | 🟢 docx_classic37_real_estate_listing | ✅ | 1.0 | 0.9754 | 1/1 | 0.9902 |
| 45 | 🟢 docx_classic38_restaurant_menu_with_photos | ✅ | 1.0 | 0.9748 | 2/2 | 0.9899 |
| 46 | 🟢 docx_classic38_table_column_widths | ✅ | 0.8485 | 0.9821 | 1/1 | 0.9322 |
| 47 | 🟢 docx_classic39_cover_page_with_image | ✅ | 1.0 | 0.9866 | 2/2 | 0.9946 |
| 48 | 🟢 docx_classic39_financial_report | ✅ | 1.0 | 0.9783 | 1/1 | 0.9913 |
| 49 | 🟢 docx_classic40_product_catalog_with_images | ✅ | 1.0 | 0.9729 | 1/1 | 0.9892 |
| 50 | 🟢 docx_classic40_resume | ✅ | 1.0 | 0.9745 | 1/1 | 0.9898 |
| 51 | 🟢 docx_classic41_business_letter | ✅ | 0.9893 | 0.9866 | 1/1 | 0.9904 |
| 52 | 🟢 docx_classic41_newsletter_with_hero_image | ✅ | 1.0 | 0.9217 | 1/1 | 0.9687 |
| 53 | 🟢 docx_classic42_chart_image_with_data | ✅ | 1.0 | 0.9755 | 1/1 | 0.9902 |
| 54 | 🟢 docx_classic42_meeting_minutes | ✅ | 0.9903 | 0.9703 | 1/1 | 0.9842 |
| 55 | 🟢 docx_classic43_event_flyer_with_banner | ✅ | 1.0 | 0.9195 | 1/1 | 0.9678 |
| 56 | 🟢 docx_classic43_invoice_document | ✅ | 0.9775 | 0.949 | 1/1 | 0.9706 |
| 57 | 🟢 docx_classic44_dashboard_with_kpi_image | ✅ | 1.0 | 0.9745 | 1/1 | 0.9898 |
| 58 | 🟢 docx_classic44_memo | ✅ | 1.0 | 0.9768 | 1/1 | 0.9907 |
| 59 | 🟢 docx_classic45_certificate_with_seal | ✅ | 1.0 | 0.9778 | 1/1 | 0.9911 |
| 60 | 🟢 docx_classic45_project_plan | ✅ | 1.0 | 0.9546 | 1/1 | 0.9818 |
| 61 | 🟢 docx_classic46_comparison_table | ✅ | 1.0 | 0.9777 | 1/1 | 0.9911 |
| 62 | 🟢 docx_classic46_product_image_grid | ✅ | 1.0 | 0.9801 | 1/1 | 0.992 |
| 63 | 🟢 docx_classic47_data_dictionary | ✅ | 1.0 | 0.9511 | 1/1 | 0.9804 |
| 64 | 🟢 docx_classic47_news_article_with_hero_image | ✅ | 1.0 | 0.9222 | 1/1 | 0.9689 |
| 65 | 🟢 docx_classic48_multi_level_headings | ✅ | 1.0 | 0.9766 | 3/3 | 0.9906 |
| 66 | 🟢 docx_classic48_task_list_with_status_icons | ✅ | 1.0 | 0.9743 | 1/1 | 0.9897 |
| 67 | 🟢 docx_classic49_cjk_document | ✅ | 0.9849 | 0.9733 | 1/1 | 0.9833 |
| 68 | 🟢 docx_classic49_wide_panoramic_banner | ✅ | 1.0 | 0.8986 | 1/1 | 0.9594 |
| 69 | 🟡 docx_classic50_long_table_with_formatting | ✅ | 0.7465 | 0.634 | 2/2 | 0.7522 |
| 70 | 🟢 docx_classic50_portrait_tall_image | ✅ | 1.0 | 0.9808 | 1/1 | 0.9923 |
| 71 | 🟢 docx_classic51_step_by_step_with_images | ✅ | 1.0 | 0.9758 | 1/1 | 0.9903 |
| 72 | 🟢 docx_classic51_underline_styles | ✅ | 1.0 | 0.9947 | 1/1 | 0.9979 |
| 73 | 🟢 docx_classic52_before_after_comparison | ✅ | 1.0 | 0.9726 | 1/1 | 0.989 |
| 74 | 🟢 docx_classic52_spacing_before_after | ✅ | 1.0 | 0.9751 | 1/1 | 0.99 |
| 75 | 🟢 docx_classic53_color_swatch_palette | ✅ | 1.0 | 0.9723 | 1/1 | 0.9889 |
| 76 | 🟢 docx_classic53_table_merged_complex | ✅ | 1.0 | 0.9792 | 1/1 | 0.9917 |
| 77 | 🟢 docx_classic54_multi_font_family | ✅ | 1.0 | 0.9859 | 1/1 | 0.9944 |
| 78 | 🟢 docx_classic54_travel_destination_cards | ✅ | 1.0 | 0.9799 | 1/1 | 0.992 |
| 79 | 🟢 docx_classic55_background_shading_paragraph | ✅ | 1.0 | 0.9366 | 1/1 | 0.9746 |
| 80 | 🟢 docx_classic55_lab_results_with_image | ✅ | 1.0 | 0.9545 | 1/1 | 0.9818 |
| 81 | 🟢 docx_classic56_images_and_tables_mixed | ✅ | 1.0 | 0.952 | 1/1 | 0.9808 |
| 82 | 🟢 docx_classic56_software_screenshot_features | ✅ | 1.0 | 0.9692 | 1/1 | 0.9877 |
| 83 | 🟢 docx_classic57_right_to_left_text | ✅ | 0.9328 | 0.9897 | 1/1 | 0.969 |
| 84 | 🟢 docx_classic57_sports_results_with_logos | ✅ | 1.0 | 0.9531 | 1/1 | 0.9812 |
| 85 | 🟡 docx_classic58_dense_paragraph_document | ✅ | 1.0 | 0.7725 | 6/7 | 0.809 |
| 86 | 🟢 docx_classic58_report_with_footer_image | ✅ | 1.0 | 0.9544 | 1/1 | 0.9818 |
| 87 | 🟢 docx_classic59_numbered_and_bullet_mixed | ✅ | 0.9948 | 0.9816 | 1/1 | 0.9906 |
| 88 | 🟢 docx_classic59_nutrition_label_with_image | ✅ | 1.0 | 0.9718 | 1/1 | 0.9887 |
| 89 | 🟢 docx_classic60_comprehensive_styled_report | ✅ | 1.0 | 0.9579 | 3/3 | 0.9832 |
| 90 | 🟢 docx_classic60_project_status_with_milestones | ✅ | 1.0 | 0.9734 | 2/2 | 0.9894 |
| 91 | 🟢 docx_classic61_header_and_footer | ✅ | 1.0 | 0.9662 | 1/1 | 0.9865 |
| 92 | 🟢 docx_classic62_footnote_references | ✅ | 1.0 | 0.9763 | 1/1 | 0.9905 |
| 93 | 🟢 docx_classic63_toc_style_headings | ✅ | 1.0 | 0.9804 | 1/1 | 0.9922 |
| 94 | 🟢 docx_classic64_multi_column_layout | ✅ | 0.8494 | 0.9755 | 1/1 | 0.93 |
| 95 | 🟢 docx_classic65_code_block_styling | ✅ | 1.0 | 0.8727 | 1/1 | 0.9491 |
| 96 | 🟢 docx_classic66_colored_title_page | ✅ | 1.0 | 0.9709 | 1/1 | 0.9884 |
| 97 | 🟢 docx_classic67_alternating_row_table | ✅ | 0.9778 | 0.9277 | 1/1 | 0.9622 |
| 98 | 🟢 docx_classic68_sidebar_layout | ✅ | 0.9091 | 0.9578 | 1/1 | 0.9468 |
| 99 | 🟢 docx_classic69_blockquote_styling | ✅ | 1.0 | 0.9769 | 1/1 | 0.9908 |
| 100 | 🟢 docx_classic70_academic_paper | ✅ | 0.9924 | 0.9463 | 1/1 | 0.9755 |
| 101 | 🟢 docx_classic71_legal_document | ✅ | 1.0 | 0.9722 | 1/1 | 0.9889 |
| 102 | 🟢 docx_classic72_technical_specification | ✅ | 1.0 | 0.9368 | 1/1 | 0.9747 |
| 103 | 🟢 docx_classic73_calendar_layout | ✅ | 1.0 | 0.9512 | 1/1 | 0.9805 |
| 104 | 🟢 docx_classic74_org_chart | ✅ | 1.0 | 0.9177 | 1/1 | 0.9671 |
| 105 | 🟢 docx_classic75_newsletter_layout | ✅ | 0.9857 | 0.9726 | 1/1 | 0.9833 |
| 106 | 🟢 docx_classic76_recipe_card | ✅ | 1.0 | 0.9654 | 1/1 | 0.9862 |
| 107 | 🟢 docx_classic77_timeline_layout | ✅ | 1.0 | 0.929 | 1/1 | 0.9716 |
| 108 | 🟢 docx_classic78_faq_document | ✅ | 1.0 | 0.9527 | 2/2 | 0.9811 |
| 109 | 🟢 docx_classic79_glossary | ✅ | 1.0 | 0.969 | 1/1 | 0.9876 |
| 110 | 🟢 docx_classic80_matrix_grid | ✅ | 0.9737 | 0.9595 | 1/1 | 0.9733 |
| 111 | 🟢 docx_classic81_budget_table | ✅ | 1.0 | 0.9228 | 1/1 | 0.9691 |
| 112 | 🟢 docx_classic82_survey_questionnaire | ✅ | 0.9883 | 0.9575 | 2/2 | 0.9783 |
| 113 | 🟢 docx_classic83_medical_form | ✅ | 1.0 | 0.9441 | 1/1 | 0.9776 |
| 114 | 🟢 docx_classic84_shipping_label | ✅ | 1.0 | 0.9205 | 1/1 | 0.9682 |
| 115 | 🟢 docx_classic85_report_card | ✅ | 1.0 | 0.9347 | 1/1 | 0.9739 |
| 116 | 🟢 docx_classic86_checklist_document | ✅ | 1.0 | 0.9714 | 1/1 | 0.9886 |
| 117 | 🟢 docx_classic87_bibliography | ✅ | 1.0 | 0.9746 | 1/1 | 0.9898 |
| 118 | 🟡 docx_classic88_presentation_handout | ✅ | 1.0 | 0.6971 | 2/2 | 0.8788 |
| 119 | 🟢 docx_classic89_multi_image_gallery | ✅ | 1.0 | 0.9495 | 1/1 | 0.9798 |
| 120 | 🟢 docx_classic90_comprehensive_annual_report | ✅ | 1.0 | 0.9606 | 4/4 | 0.9842 |
| 121 | 🟢 docx_classic91_landscape_page | ✅ | 1.0 | 0.9776 | 1/1 | 0.991 |
| 122 | 🟢 docx_classic92_first_line_indent | ✅ | 0.9868 | 0.9635 | 1/1 | 0.9801 |
| 123 | 🟢 docx_classic93_hanging_indent | ✅ | 1.0 | 0.9574 | 1/1 | 0.983 |
| 124 | 🟢 docx_classic94_custom_bullet_characters | ✅ | 1.0 | 0.9851 | 1/1 | 0.994 |
| 125 | 🟢 docx_classic95_contract_template | ✅ | 1.0 | 0.9813 | 1/1 | 0.9925 |
| 126 | 🟢 docx_classic96_dense_data_table | ✅ | 1.0 | 0.8705 | 1/1 | 0.9482 |
| 127 | 🟢 docx_classic97_product_catalog | ✅ | 1.0 | 0.9367 | 1/1 | 0.9747 |
| 128 | 🟢 docx_classic98_training_manual | ✅ | 1.0 | 0.962 | 1/1 | 0.9848 |
| 129 | 🟢 docx_classic99_policy_document | ✅ | 1.0 | 0.9494 | 1/1 | 0.9798 |
| 130 | 🟢 docx_classic100_multi_page_table | ✅ | 0.9932 | 0.8609 | 2/2 | 0.9416 |
| 131 | 🟢 docx_classic101_warranty_document | ✅ | 1.0 | 0.963 | 1/1 | 0.9852 |
| 132 | 🟢 docx_classic102_curriculum_syllabus | ✅ | 0.9961 | 0.9448 | 1/1 | 0.9764 |
| 133 | 🟢 docx_classic103_event_program | ✅ | 0.9917 | 0.9646 | 2/2 | 0.9825 |
| 134 | 🟢 docx_classic104_sop_document | ✅ | 0.9915 | 0.9759 | 2/2 | 0.987 |
| 135 | 🟢 docx_classic105_certificate | ✅ | 1.0 | 0.9717 | 1/1 | 0.9887 |
| 136 | 🟢 docx_classic106_multi_section_orientation | ✅ | 0.9933 | 0.9772 | 3/3 | 0.9882 |
| 137 | 🟢 docx_classic107_order_form | ✅ | 0.9859 | 0.94 | 1/1 | 0.9704 |
| 138 | 🟢 docx_classic108_comparison_matrix | ✅ | 0.9753 | 0.9626 | 1/1 | 0.9752 |
| 139 | 🟢 docx_classic109_release_notes | ✅ | 1.0 | 0.9695 | 1/1 | 0.9878 |
| 140 | 🟢 docx_classic110_troubleshooting_guide | ✅ | 1.0 | 0.9713 | 1/1 | 0.9885 |
| 141 | 🟢 docx_classic111_meeting_agenda | ✅ | 0.9912 | 0.9646 | 1/1 | 0.9823 |
| 142 | 🟢 docx_classic112_project_status_report | ✅ | 0.9798 | 0.9566 | 1/1 | 0.9746 |
| 143 | 🟢 docx_classic113_address_labels | ✅ | 1.0 | 0.9795 | 1/1 | 0.9918 |
| 144 | 🟢 docx_classic114_test_report | ✅ | 1.0 | 0.9472 | 1/1 | 0.9789 |
| 145 | 🟢 docx_classic115_price_list | ✅ | 0.9709 | 0.9118 | 1/1 | 0.9531 |
| 146 | 🟢 docx_classic116_risk_assessment | ✅ | 1.0 | 0.939 | 1/1 | 0.9756 |
| 147 | 🟢 docx_classic117_employee_handbook_excerpt | ✅ | 1.0 | 0.9525 | 1/1 | 0.981 |
| 148 | 🟢 docx_classic118_data_report_with_summary | ✅ | 1.0 | 0.9337 | 1/1 | 0.9735 |
| 149 | 🟢 docx_classic119_multi_language_document | ✅ | 0.9876 | 0.9692 | 1/1 | 0.9827 |
| 150 | 🟢 docx_classic120_comprehensive_business_proposal | ✅ | 0.9918 | 0.9623 | 3/3 | 0.9816 |
| 151 | 🟢 docx_classic121_thin_border_table | ✅ | 1.0 | 0.9867 | 1/1 | 0.9947 |
| 152 | 🟢 docx_classic122_thick_outer_border_table | ✅ | 1.0 | 0.9849 | 1/1 | 0.994 |
| 153 | 🟢 docx_classic123_dashed_border_table | ✅ | 1.0 | 0.9922 | 1/1 | 0.9969 |
| 154 | 🟢 docx_classic124_colored_border_table | ✅ | 1.0 | 0.9847 | 1/1 | 0.9939 |
| 155 | 🟢 docx_classic125_solid_cell_fills | ✅ | 1.0 | 0.9725 | 1/1 | 0.989 |
| 156 | 🟢 docx_classic126_dark_header_table | ✅ | 1.0 | 0.9803 | 1/1 | 0.9921 |
| 157 | 🟢 docx_classic127_font_styles_showcase | ✅ | 1.0 | 0.9819 | 1/1 | 0.9928 |
| 158 | 🟢 docx_classic128_font_sizes_showcase | ✅ | 1.0 | 0.9733 | 1/1 | 0.9893 |
| 159 | 🟢 docx_classic129_alignment_combinations | ✅ | 1.0 | 0.9882 | 1/1 | 0.9953 |
| 160 | 🟢 docx_classic130_wrap_and_indent | ✅ | 1.0 | 0.9824 | 1/1 | 0.993 |
| 161 | 🟢 docx_classic131_number_format_table | ✅ | 1.0 | 0.9797 | 1/1 | 0.9919 |
| 162 | 🟢 docx_classic132_striped_table | ✅ | 1.0 | 0.9339 | 1/1 | 0.9736 |
| 163 | 🟢 docx_classic133_gradient_rows_table | ✅ | 1.0 | 0.9625 | 1/1 | 0.985 |
| 164 | 🟢 docx_classic134_heatmap_table | ✅ | 1.0 | 0.9646 | 1/1 | 0.9858 |
| 165 | 🟢 docx_classic135_bottom_border_paragraphs | ✅ | 1.0 | 0.9803 | 1/1 | 0.9921 |
| 166 | 🟢 docx_classic136_financial_statement | ✅ | 1.0 | 0.9675 | 1/1 | 0.987 |
| 167 | 🟢 docx_classic137_checkerboard_table | ✅ | 1.0 | 0.9571 | 1/1 | 0.9828 |
| 168 | 🟢 docx_classic138_color_grid_table | ✅ | 1.0 | 0.9813 | 1/1 | 0.9925 |
| 169 | 🟢 docx_classic139_paragraph_shading_patterns | ✅ | 1.0 | 0.8805 | 1/1 | 0.9522 |
| 170 | 🟢 docx_classic140_rotated_text_table | ✅ | 0.8158 | 0.9742 | 1/1 | 0.916 |
| 171 | 🟢 docx_classic141_mixed_border_styles | ✅ | 1.0 | 0.9903 | 1/1 | 0.9961 |
| 172 | 🟢 docx_classic142_styled_invoice_document | ✅ | 1.0 | 0.9306 | 1/1 | 0.9722 |
| 173 | 🟢 docx_classic143_multi_section_document | ✅ | 1.0 | 0.8782 | 1/1 | 0.9513 |
| 174 | 🟢 docx_classic144_note_style_paragraphs | ✅ | 1.0 | 0.986 | 1/1 | 0.9944 |
| 175 | 🟢 docx_classic145_status_badge_table | ✅ | 1.0 | 0.9703 | 1/1 | 0.9881 |
| 176 | 🟢 docx_classic146_double_border_table | ✅ | 1.0 | 0.9805 | 1/1 | 0.9922 |
| 177 | 🟢 docx_classic147_multi_section_styled_report | ✅ | 1.0 | 0.9299 | 1/1 | 0.972 |
| 178 | 🟢 docx_classic148_data_grid_document | ✅ | 1.0 | 0.894 | 1/1 | 0.9576 |
| 179 | 🟢 docx_classic149_merged_section_report | ✅ | 1.0 | 0.9226 | 1/1 | 0.969 |
| 180 | 🟢 docx_classic150_kitchen_sink_styles | ✅ | 1.0 | 0.9559 | 2/2 | 0.9824 |
Average Overall Score: 0.9765
Visual Comparison
| MiniPdf | LibreOffice (Reference) |
|---|---|
| docx_classic01_single_paragraph | docx_classic01_single_paragraph ⬤ 99.9% |
![]() |
![]() |
| docx_classic02_multiple_paragraphs | docx_classic02_multiple_paragraphs ⬤ 99.3% |
![]() |
![]() |
| docx_classic03_headings | docx_classic03_headings ⬤ 99.7% |
![]() |
![]() |
| docx_classic04_bold_italic | docx_classic04_bold_italic ⬤ 99.9% |
![]() |
![]() |
| docx_classic05_font_sizes | docx_classic05_font_sizes ⬤ 99.6% |
![]() |
![]() |
| docx_classic06_font_colors | docx_classic06_font_colors ⬤ 100.0% |
![]() |
![]() |
| docx_classic07_alignment | docx_classic07_alignment ⬤ 99.6% |
![]() |
![]() |
| docx_classic08_bullet_list | docx_classic08_bullet_list ⬤ 99.9% |
![]() |
![]() |
| docx_classic09_numbered_list | docx_classic09_numbered_list ⬤ 99.8% |
![]() |
![]() |
| docx_classic10_simple_table | docx_classic10_simple_table ⬤ 99.6% |
![]() |
![]() |
| docx_classic11_table_with_shading | docx_classic11_table_with_shading ⬤ 99.2% |
![]() |
![]() |
| docx_classic12_merged_cells_table | docx_classic12_merged_cells_table ⬤ 99.7% |
![]() |
![]() |
| docx_classic13_long_document | docx_classic13_long_document ⬤ 77.5% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| missing | ![]() |
| docx_classic14_mixed_content | docx_classic14_mixed_content ⬤ 99.0% |
![]() |
![]() |
| docx_classic15_indentation | docx_classic15_indentation ⬤ 99.3% |
![]() |
![]() |
| docx_classic16_line_spacing | docx_classic16_line_spacing ⬤ 99.3% |
![]() |
![]() |
| docx_classic17_page_break | docx_classic17_page_break ⬤ 99.9% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic18_embedded_image | docx_classic18_embedded_image ⬤ 99.7% |
![]() |
![]() |
| docx_classic19_multiple_images | docx_classic19_multiple_images ⬤ 99.5% |
![]() |
![]() |
| docx_classic20_table_with_many_rows | docx_classic20_table_with_many_rows ⬤ 98.6% |
![]() |
![]() |
| docx_classic21_nested_lists | docx_classic21_nested_lists ⬤ 99.7% |
![]() |
![]() |
| docx_classic22_horizontal_rule | docx_classic22_horizontal_rule ⬤ 99.7% |
![]() |
![]() |
| docx_classic23_mixed_formatting_runs | docx_classic23_mixed_formatting_runs ⬤ 99.9% |
![]() |
![]() |
| docx_classic24_two_column_table_layout | docx_classic24_two_column_table_layout ⬤ 91.3% |
![]() |
![]() |
| docx_classic25_title_and_subtitle | docx_classic25_title_and_subtitle ⬤ 99.8% |
![]() |
![]() |
![]() |
![]() |
| docx_classic26_table_alignment | docx_classic26_table_alignment ⬤ 99.7% |
![]() |
![]() |
| docx_classic27_long_paragraph_wrapping | docx_classic27_long_paragraph_wrapping ⬤ 97.4% |
![]() |
![]() |
| docx_classic28_special_characters | docx_classic28_special_characters ⬤ 99.7% |
![]() |
![]() |
| docx_classic29_table_with_image | docx_classic29_table_with_image ⬤ 98.5% |
![]() |
![]() |
| docx_classic30_comprehensive_report | docx_classic30_comprehensive_report ⬤ 91.1% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic31_product_card_with_image | docx_classic31_product_card_with_image ⬤ 99.1% |
![]() |
![]() |
| docx_classic31_strikethrough_text | docx_classic31_strikethrough_text ⬤ 99.7% |
![]() |
![]() |
| docx_classic32_company_logo_header | docx_classic32_company_logo_header ⬤ 98.2% |
![]() |
![]() |
| docx_classic32_superscript_subscript | docx_classic32_superscript_subscript ⬤ 98.9% |
![]() |
![]() |
| docx_classic33_highlighted_text | docx_classic33_highlighted_text ⬤ 97.9% |
![]() |
![]() |
| docx_classic33_two_products_side_by_side | docx_classic33_two_products_side_by_side ⬤ 99.2% |
![]() |
![]() |
| docx_classic34_employee_directory_with_photo | docx_classic34_employee_directory_with_photo ⬤ 69.6% |
![]() |
![]() |
| missing | ![]() |
| docx_classic34_paragraph_borders | docx_classic34_paragraph_borders ⬤ 99.2% |
![]() |
![]() |
| docx_classic35_inventory_with_product_photos | docx_classic35_inventory_with_product_photos ⬤ 99.1% |
![]() |
![]() |
![]() |
![]() |
| docx_classic35_tab_stops | docx_classic35_tab_stops ⬤ 85.2% |
![]() |
![]() |
| docx_classic36_invoice_with_logo | docx_classic36_invoice_with_logo ⬤ 98.6% |
![]() |
![]() |
| docx_classic36_wide_table | docx_classic36_wide_table ⬤ 99.2% |
![]() |
![]() |
| docx_classic37_nested_table | docx_classic37_nested_table ⬤ 98.2% |
![]() |
![]() |
| docx_classic37_real_estate_listing | docx_classic37_real_estate_listing ⬤ 99.0% |
![]() |
![]() |
| docx_classic38_restaurant_menu_with_photos | docx_classic38_restaurant_menu_with_photos ⬤ 99.0% |
![]() |
![]() |
![]() |
![]() |
| docx_classic38_table_column_widths | docx_classic38_table_column_widths ⬤ 93.2% |
![]() |
![]() |
| docx_classic39_cover_page_with_image | docx_classic39_cover_page_with_image ⬤ 99.5% |
![]() |
![]() |
![]() |
![]() |
| docx_classic39_financial_report | docx_classic39_financial_report ⬤ 99.1% |
![]() |
![]() |
| docx_classic40_product_catalog_with_images | docx_classic40_product_catalog_with_images ⬤ 98.9% |
![]() |
![]() |
| docx_classic40_resume | docx_classic40_resume ⬤ 99.0% |
![]() |
![]() |
| docx_classic41_business_letter | docx_classic41_business_letter ⬤ 99.0% |
![]() |
![]() |
| docx_classic41_newsletter_with_hero_image | docx_classic41_newsletter_with_hero_image ⬤ 96.9% |
![]() |
![]() |
| docx_classic42_chart_image_with_data | docx_classic42_chart_image_with_data ⬤ 99.0% |
![]() |
![]() |
| docx_classic42_meeting_minutes | docx_classic42_meeting_minutes ⬤ 98.4% |
![]() |
![]() |
| docx_classic43_event_flyer_with_banner | docx_classic43_event_flyer_with_banner ⬤ 96.8% |
![]() |
![]() |
| docx_classic43_invoice_document | docx_classic43_invoice_document ⬤ 97.1% |
![]() |
![]() |
| docx_classic44_dashboard_with_kpi_image | docx_classic44_dashboard_with_kpi_image ⬤ 99.0% |
![]() |
![]() |
| docx_classic44_memo | docx_classic44_memo ⬤ 99.1% |
![]() |
![]() |
| docx_classic45_certificate_with_seal | docx_classic45_certificate_with_seal ⬤ 99.1% |
![]() |
![]() |
| docx_classic45_project_plan | docx_classic45_project_plan ⬤ 98.2% |
![]() |
![]() |
| docx_classic46_comparison_table | docx_classic46_comparison_table ⬤ 99.1% |
![]() |
![]() |
| docx_classic46_product_image_grid | docx_classic46_product_image_grid ⬤ 99.2% |
![]() |
![]() |
| docx_classic47_data_dictionary | docx_classic47_data_dictionary ⬤ 98.0% |
![]() |
![]() |
| docx_classic47_news_article_with_hero_image | docx_classic47_news_article_with_hero_image ⬤ 96.9% |
![]() |
![]() |
| docx_classic48_multi_level_headings | docx_classic48_multi_level_headings ⬤ 99.1% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic48_task_list_with_status_icons | docx_classic48_task_list_with_status_icons ⬤ 99.0% |
| docx_classic49_cjk_document | docx_classic49_cjk_document ⬤ 98.3% |
![]() |
![]() |
| docx_classic49_wide_panoramic_banner | docx_classic49_wide_panoramic_banner ⬤ 95.9% |
![]() |
![]() |
| docx_classic50_long_table_with_formatting | docx_classic50_long_table_with_formatting ⬤ 75.2% |
![]() |
![]() |
![]() |
![]() |
| docx_classic50_portrait_tall_image | docx_classic50_portrait_tall_image ⬤ 99.2% |
![]() |
![]() |
| docx_classic51_step_by_step_with_images | docx_classic51_step_by_step_with_images ⬤ 99.0% |
![]() |
![]() |
| docx_classic51_underline_styles | docx_classic51_underline_styles ⬤ 99.8% |
![]() |
![]() |
| docx_classic52_before_after_comparison | docx_classic52_before_after_comparison ⬤ 98.9% |
![]() |
![]() |
| docx_classic52_spacing_before_after | docx_classic52_spacing_before_after ⬤ 99.0% |
![]() |
![]() |
| docx_classic53_color_swatch_palette | docx_classic53_color_swatch_palette ⬤ 98.9% |
![]() |
![]() |
| docx_classic53_table_merged_complex | docx_classic53_table_merged_complex ⬤ 99.2% |
![]() |
![]() |
| docx_classic54_multi_font_family | docx_classic54_multi_font_family ⬤ 99.4% |
![]() |
![]() |
| docx_classic54_travel_destination_cards | docx_classic54_travel_destination_cards ⬤ 99.2% |
![]() |
![]() |
| docx_classic55_background_shading_paragraph | docx_classic55_background_shading_paragraph ⬤ 97.5% |
![]() |
![]() |
| docx_classic55_lab_results_with_image | docx_classic55_lab_results_with_image ⬤ 98.2% |
![]() |
![]() |
| docx_classic56_images_and_tables_mixed | docx_classic56_images_and_tables_mixed ⬤ 98.1% |
![]() |
![]() |
| docx_classic56_software_screenshot_features | docx_classic56_software_screenshot_features ⬤ 98.8% |
![]() |
![]() |
| docx_classic57_right_to_left_text | docx_classic57_right_to_left_text ⬤ 96.9% |
![]() |
![]() |
| docx_classic57_sports_results_with_logos | docx_classic57_sports_results_with_logos ⬤ 98.1% |
![]() |
![]() |
| docx_classic58_dense_paragraph_document | docx_classic58_dense_paragraph_document ⬤ 80.9% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| missing | ![]() |
| docx_classic58_report_with_footer_image | docx_classic58_report_with_footer_image ⬤ 98.2% |
![]() |
![]() |
| docx_classic59_numbered_and_bullet_mixed | docx_classic59_numbered_and_bullet_mixed ⬤ 99.1% |
![]() |
![]() |
| docx_classic59_nutrition_label_with_image | docx_classic59_nutrition_label_with_image ⬤ 98.9% |
![]() |
![]() |
| docx_classic60_comprehensive_styled_report | docx_classic60_comprehensive_styled_report ⬤ 98.3% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic60_project_status_with_milestones | docx_classic60_project_status_with_milestones ⬤ 98.9% |
![]() |
![]() |
![]() |
![]() |
| docx_classic61_header_and_footer | docx_classic61_header_and_footer ⬤ 98.7% |
![]() |
![]() |
| docx_classic62_footnote_references | docx_classic62_footnote_references ⬤ 99.1% |
![]() |
![]() |
| docx_classic63_toc_style_headings | docx_classic63_toc_style_headings ⬤ 99.2% |
![]() |
![]() |
| docx_classic64_multi_column_layout | docx_classic64_multi_column_layout ⬤ 93.0% |
![]() |
![]() |
| docx_classic65_code_block_styling | docx_classic65_code_block_styling ⬤ 94.9% |
![]() |
![]() |
| docx_classic66_colored_title_page | docx_classic66_colored_title_page ⬤ 98.8% |
![]() |
![]() |
| docx_classic67_alternating_row_table | docx_classic67_alternating_row_table ⬤ 96.2% |
![]() |
![]() |
| docx_classic68_sidebar_layout | docx_classic68_sidebar_layout ⬤ 94.7% |
![]() |
![]() |
| docx_classic69_blockquote_styling | docx_classic69_blockquote_styling ⬤ 99.1% |
![]() |
![]() |
| docx_classic70_academic_paper | docx_classic70_academic_paper ⬤ 97.5% |
![]() |
![]() |
| docx_classic71_legal_document | docx_classic71_legal_document ⬤ 98.9% |
![]() |
![]() |
| docx_classic72_technical_specification | docx_classic72_technical_specification ⬤ 97.5% |
![]() |
![]() |
| docx_classic73_calendar_layout | docx_classic73_calendar_layout ⬤ 98.0% |
![]() |
![]() |
| docx_classic74_org_chart | docx_classic74_org_chart ⬤ 96.7% |
![]() |
![]() |
| docx_classic75_newsletter_layout | docx_classic75_newsletter_layout ⬤ 98.3% |
![]() |
![]() |
| docx_classic76_recipe_card | docx_classic76_recipe_card ⬤ 98.6% |
![]() |
![]() |
| docx_classic77_timeline_layout | docx_classic77_timeline_layout ⬤ 97.2% |
![]() |
![]() |
| docx_classic78_faq_document | docx_classic78_faq_document ⬤ 98.1% |
![]() |
![]() |
![]() |
![]() |
| docx_classic79_glossary | docx_classic79_glossary ⬤ 98.8% |
![]() |
![]() |
| docx_classic80_matrix_grid | docx_classic80_matrix_grid ⬤ 97.3% |
![]() |
![]() |
| docx_classic81_budget_table | docx_classic81_budget_table ⬤ 96.9% |
![]() |
![]() |
| docx_classic82_survey_questionnaire | docx_classic82_survey_questionnaire ⬤ 97.8% |
![]() |
![]() |
![]() |
![]() |
| docx_classic83_medical_form | docx_classic83_medical_form ⬤ 97.8% |
![]() |
![]() |
| docx_classic84_shipping_label | docx_classic84_shipping_label ⬤ 96.8% |
![]() |
![]() |
| docx_classic85_report_card | docx_classic85_report_card ⬤ 97.4% |
![]() |
![]() |
| docx_classic86_checklist_document | docx_classic86_checklist_document ⬤ 98.9% |
![]() |
![]() |
| docx_classic87_bibliography | docx_classic87_bibliography ⬤ 99.0% |
![]() |
![]() |
| docx_classic88_presentation_handout | docx_classic88_presentation_handout ⬤ 87.9% |
![]() |
![]() |
![]() |
![]() |
| docx_classic89_multi_image_gallery | docx_classic89_multi_image_gallery ⬤ 98.0% |
![]() |
![]() |
| docx_classic90_comprehensive_annual_report | docx_classic90_comprehensive_annual_report ⬤ 98.4% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic91_landscape_page | docx_classic91_landscape_page ⬤ 99.1% |
![]() |
![]() |
| docx_classic92_first_line_indent | docx_classic92_first_line_indent ⬤ 98.0% |
![]() |
![]() |
| docx_classic93_hanging_indent | docx_classic93_hanging_indent ⬤ 98.3% |
![]() |
![]() |
| docx_classic94_custom_bullet_characters | docx_classic94_custom_bullet_characters ⬤ 99.4% |
![]() |
![]() |
| docx_classic95_contract_template | docx_classic95_contract_template ⬤ 99.2% |
![]() |
![]() |
| docx_classic96_dense_data_table | docx_classic96_dense_data_table ⬤ 94.8% |
![]() |
![]() |
| docx_classic97_product_catalog | docx_classic97_product_catalog ⬤ 97.5% |
![]() |
![]() |
| docx_classic98_training_manual | docx_classic98_training_manual ⬤ 98.5% |
![]() |
![]() |
| docx_classic99_policy_document | docx_classic99_policy_document ⬤ 98.0% |
![]() |
![]() |
| docx_classic100_multi_page_table | docx_classic100_multi_page_table ⬤ 94.2% |
![]() |
![]() |
![]() |
![]() |
| docx_classic101_warranty_document | docx_classic101_warranty_document ⬤ 98.5% |
![]() |
![]() |
| docx_classic102_curriculum_syllabus | docx_classic102_curriculum_syllabus ⬤ 97.6% |
![]() |
![]() |
| docx_classic103_event_program | docx_classic103_event_program ⬤ 98.2% |
![]() |
![]() |
![]() |
![]() |
| docx_classic104_sop_document | docx_classic104_sop_document ⬤ 98.7% |
![]() |
![]() |
![]() |
![]() |
| docx_classic105_certificate | docx_classic105_certificate ⬤ 98.9% |
![]() |
![]() |
| docx_classic106_multi_section_orientation | docx_classic106_multi_section_orientation ⬤ 98.8% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic107_order_form | docx_classic107_order_form ⬤ 97.0% |
![]() |
![]() |
| docx_classic108_comparison_matrix | docx_classic108_comparison_matrix ⬤ 97.5% |
![]() |
![]() |
| docx_classic109_release_notes | docx_classic109_release_notes ⬤ 98.8% |
![]() |
![]() |
| docx_classic110_troubleshooting_guide | docx_classic110_troubleshooting_guide ⬤ 98.9% |
![]() |
![]() |
| docx_classic111_meeting_agenda | docx_classic111_meeting_agenda ⬤ 98.2% |
![]() |
![]() |
| docx_classic112_project_status_report | docx_classic112_project_status_report ⬤ 97.5% |
![]() |
![]() |
| docx_classic113_address_labels | docx_classic113_address_labels ⬤ 99.2% |
![]() |
![]() |
| docx_classic114_test_report | docx_classic114_test_report ⬤ 97.9% |
![]() |
![]() |
| docx_classic115_price_list | docx_classic115_price_list ⬤ 95.3% |
![]() |
![]() |
| docx_classic116_risk_assessment | docx_classic116_risk_assessment ⬤ 97.6% |
![]() |
![]() |
| docx_classic117_employee_handbook_excerpt | docx_classic117_employee_handbook_excerpt ⬤ 98.1% |
![]() |
![]() |
| docx_classic118_data_report_with_summary | docx_classic118_data_report_with_summary ⬤ 97.4% |
![]() |
![]() |
| docx_classic119_multi_language_document | docx_classic119_multi_language_document ⬤ 98.3% |
![]() |
![]() |
| docx_classic120_comprehensive_business_proposal | docx_classic120_comprehensive_business_proposal ⬤ 98.2% |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| docx_classic121_thin_border_table | docx_classic121_thin_border_table ⬤ 99.5% |
![]() |
![]() |
| docx_classic122_thick_outer_border_table | docx_classic122_thick_outer_border_table ⬤ 99.4% |
![]() |
![]() |
| docx_classic123_dashed_border_table | docx_classic123_dashed_border_table ⬤ 99.7% |
![]() |
![]() |
| docx_classic124_colored_border_table | docx_classic124_colored_border_table ⬤ 99.4% |
![]() |
![]() |
| docx_classic125_solid_cell_fills | docx_classic125_solid_cell_fills ⬤ 98.9% |
![]() |
![]() |
| docx_classic126_dark_header_table | docx_classic126_dark_header_table ⬤ 99.2% |
![]() |
![]() |
| docx_classic127_font_styles_showcase | docx_classic127_font_styles_showcase ⬤ 99.3% |
![]() |
![]() |
| docx_classic128_font_sizes_showcase | docx_classic128_font_sizes_showcase ⬤ 98.9% |
![]() |
![]() |
| docx_classic129_alignment_combinations | docx_classic129_alignment_combinations ⬤ 99.5% |
![]() |
![]() |
| docx_classic130_wrap_and_indent | docx_classic130_wrap_and_indent ⬤ 99.3% |
![]() |
![]() |
| docx_classic131_number_format_table | docx_classic131_number_format_table ⬤ 99.2% |
![]() |
![]() |
| docx_classic132_striped_table | docx_classic132_striped_table ⬤ 97.4% |
![]() |
![]() |
| docx_classic133_gradient_rows_table | docx_classic133_gradient_rows_table ⬤ 98.5% |
![]() |
![]() |
| docx_classic134_heatmap_table | docx_classic134_heatmap_table ⬤ 98.6% |
![]() |
![]() |
| docx_classic135_bottom_border_paragraphs | docx_classic135_bottom_border_paragraphs ⬤ 99.2% |
![]() |
![]() |
| docx_classic136_financial_statement | docx_classic136_financial_statement ⬤ 98.7% |
![]() |
![]() |
| docx_classic137_checkerboard_table | docx_classic137_checkerboard_table ⬤ 98.3% |
![]() |
![]() |
| docx_classic138_color_grid_table | docx_classic138_color_grid_table ⬤ 99.2% |
![]() |
![]() |
| docx_classic139_paragraph_shading_patterns | docx_classic139_paragraph_shading_patterns ⬤ 95.2% |
![]() |
![]() |
| docx_classic140_rotated_text_table | docx_classic140_rotated_text_table ⬤ 91.6% |
![]() |
![]() |
| docx_classic141_mixed_border_styles | docx_classic141_mixed_border_styles ⬤ 99.6% |
![]() |
![]() |
| docx_classic142_styled_invoice_document | docx_classic142_styled_invoice_document ⬤ 97.2% |
![]() |
![]() |
| docx_classic143_multi_section_document | docx_classic143_multi_section_document ⬤ 95.1% |
![]() |
![]() |
| docx_classic144_note_style_paragraphs | docx_classic144_note_style_paragraphs ⬤ 99.4% |
![]() |
![]() |
| docx_classic145_status_badge_table | docx_classic145_status_badge_table ⬤ 98.8% |
![]() |
![]() |
| docx_classic146_double_border_table | docx_classic146_double_border_table ⬤ 99.2% |
![]() |
![]() |
| docx_classic147_multi_section_styled_report | docx_classic147_multi_section_styled_report ⬤ 97.2% |
![]() |
![]() |
| docx_classic148_data_grid_document | docx_classic148_data_grid_document ⬤ 95.8% |
![]() |
![]() |
| docx_classic149_merged_section_report | docx_classic149_merged_section_report ⬤ 96.9% |
![]() |
![]() |
| docx_classic150_kitchen_sink_styles | docx_classic150_kitchen_sink_styles ⬤ 98.2% |
![]() |
![]() |
![]() |
![]() |
Detailed Results
docx_classic01_single_paragraph
- Text Similarity: 1.0
- Visual Average: 0.997
- Overall Score: 0.9988
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=260100 bytes, Reference=48037 bytes
Text content: ✅ Identical
docx_classic02_multiple_paragraphs
- Text Similarity: 1.0
- Visual Average: 0.9831
- Overall Score: 0.9932
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=262290 bytes, Reference=48946 bytes
Text Diff
--- minipdf/docx_classic02_multiple_paragraphs.pdf
+++ reference/docx_classic02_multiple_paragraphs.pdf
@@ -1,10 +1,10 @@
-This is paragraph 1. It contains some sample text to test how MiniPdf handles multiple consecutive
-paragraphs with default spacing.
-This is paragraph 2. It contains some sample text to test how MiniPdf handles multiple consecutive
-paragraphs with default spacing.
-This is paragraph 3. It contains some sample text to test how MiniPdf handles multiple consecutive
-paragraphs with default spacing.
-This is paragraph 4. It contains some sample text to test how MiniPdf handles multiple consecutive
-paragraphs with default spacing.
-This is paragraph 5. It contains some sample text to test how MiniPdf handles multiple consecutive
-paragraphs with default spacing.
+This is paragraph 1. It contains some sample text to test how MiniPdf handles multiple
+consecutive paragraphs with default spacing.
+This is paragraph 2. It contains some sample text to test how MiniPdf handles multiple
+consecutive paragraphs with default spacing.
+This is paragraph 3. It contains some sample text to test how MiniPdf handles multiple
+consecutive paragraphs with default spacing.
+This is paragraph 4. It contains some sample text to test how MiniPdf handles multiple
+consecutive paragraphs with default spacing.
+This is paragraph 5. It contains some sample text to test how MiniPdf handles multiple
+consecutive paragraphs with default spacing.
docx_classic03_headings
- Text Similarity: 1.0
- Visual Average: 0.9916
- Overall Score: 0.9966
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1316211 bytes, Reference=95865 bytes
Text content: ✅ Identical
docx_classic04_bold_italic
- Text Similarity: 1.0
- Visual Average: 0.9982
- Overall Score: 0.9993
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=700697 bytes, Reference=94134 bytes
Text content: ✅ Identical
docx_classic05_font_sizes
- Text Similarity: 1.0
- Visual Average: 0.9902
- Overall Score: 0.9961
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=256613 bytes, Reference=43230 bytes
Text content: ✅ Identical
docx_classic06_font_colors
- Text Similarity: 1.0
- Visual Average: 0.9991
- Overall Score: 0.9996
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=256761 bytes, Reference=41050 bytes
Text content: ✅ Identical
docx_classic07_alignment
- Text Similarity: 1.0
- Visual Average: 0.9887
- Overall Score: 0.9955
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=259611 bytes, Reference=43210 bytes
Text Diff
--- minipdf/docx_classic07_alignment.pdf
+++ reference/docx_classic07_alignment.pdf
@@ -1,8 +1,8 @@
-Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut
-labore et dolore magna aliqua.
-Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut
-labore et dolore magna aliqua.
-Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut
-labore et dolore magna aliqua.
-Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut
-labore et dolore magna aliqua.
+Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt
+ut labore et dolore magna aliqua.
+Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt
+ut labore et dolore magna aliqua.
+Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt
+ut labore et dolore magna aliqua.
+Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt
+ut labore et dolore magna aliqua.
docx_classic08_bullet_list
- Text Similarity: 1.0
- Visual Average: 0.9967
- Overall Score: 0.9987
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=876248 bytes, Reference=92213 bytes
Text content: ✅ Identical
docx_classic09_numbered_list
- Text Similarity: 1.0
- Visual Average: 0.9944
- Overall Score: 0.9978
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=878199 bytes, Reference=91983 bytes
Text content: ✅ Identical
docx_classic10_simple_table
- Text Similarity: 1.0
- Visual Average: 0.9905
- Overall Score: 0.9962
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883731 bytes, Reference=73744 bytes
Text content: ✅ Identical
docx_classic11_table_with_shading
- Text Similarity: 1.0
- Visual Average: 0.9797
- Overall Score: 0.9919
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=886937 bytes, Reference=91111 bytes
Text content: ✅ Identical
docx_classic12_merged_cells_table
- Text Similarity: 1.0
- Visual Average: 0.9919
- Overall Score: 0.9968
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=880703 bytes, Reference=71750 bytes
Text content: ✅ Identical
docx_classic13_long_document
- Text Similarity: 1.0
- Visual Average: 0.6873
- Overall Score: 0.7749
- Pages: MiniPdf=3, Reference=4
- File Size: MiniPdf=931425 bytes, Reference=102517 bytes
Text Diff
--- minipdf/docx_classic13_long_document.pdf
+++ reference/docx_classic13_long_document.pdf
@@ -1,109 +1,125 @@
Project Report
This document is designed to span multiple pages to test pagination in MiniPdf.
Section 1
-This is section 1 of the report. It contains detailed analysis of the topic at hand. The quick brown
-fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly quick daft
-zebras jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge my vow.This is
-section 1 of the report. It contains detailed analysis of the topic at hand. The quick brown fox
-jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly quick daft zebras
-jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge my vow.
+This is section 1 of the report. It contains detailed analysis of the topic at hand. The quick
+brown fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly
+quick daft zebras jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge
+my vow.This is section 1 of the report. It contains detailed analysis of the topic at hand. The
+quick brown fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How
+vexingly quick daft zebras jump. The five boxing wizards jump quickly. Sphinx of black
+quartz, judge my vow.
Section 2
-This is section 2 of the report. It contains detailed analysis of the topic at hand. The quick brown
-fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly quick daft
-zebras jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge my vow.This is
-section 2 of the report. It contains detailed analysis of the topic at hand. The quick brown fox
-jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly quick daft zebras
-jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge my vow.
+This is section 2 of the report. It contains detailed analysis of the topic at hand. The quick
+brown fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly
+quick daft zebras jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge
+my vow.This is section 2 of the report. It contains detailed analysis of the topic at hand. The
+quick brown fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How
+vexingly quick daft zebras jump. The five boxing wizards jump quickly. Sphinx of black
+quartz, judge my vow.
Section 3
-This is section 3 of the report. It contains detailed analysis of the topic at hand. The quick brown
-fox jumps over the lazy dog. Pack my box with five dozen liquor jugs. How vexingly quick daft
-zebras jump. The five boxing wizards jump quickly. Sphinx of black quartz, judge my vow.This is
-section 3 of the report. It contains detailed analysis of the topic at hand. The quick brown fox
-jumps over the lazy dog. Pack my box
... (14909 more characters)
docx_classic14_mixed_content
- Text Similarity: 1.0
- Visual Average: 0.9761
- Overall Score: 0.9904
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=893887 bytes, Reference=115112 bytes
Text content: ✅ Identical
docx_classic15_indentation
- Text Similarity: 1.0
- Visual Average: 0.983
- Overall Score: 0.9932
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=880189 bytes, Reference=74243 bytes
Text Diff
--- minipdf/docx_classic15_indentation.pdf
+++ reference/docx_classic15_indentation.pdf
@@ -3,6 +3,7 @@
This paragraph is indented by 36 points from the left margin.
This paragraph is indented by 72 points from the left margin.
This paragraph is indented by 108 points from the left margin.
-This paragraph is indented by 144 points from the left margin.
-This paragraph has a first-line indent of 36 points. The remaining lines wrap normally back
-to the left margin.
+This paragraph is indented by 144 points from the left
+margin.
+This paragraph has a first-line indent of 36 points. The remaining lines wrap
+normally back to the left margin.
docx_classic16_line_spacing
- Text Similarity: 1.0
- Visual Average: 0.9826
- Overall Score: 0.993
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883147 bytes, Reference=78004 bytes
Text content: ✅ Identical
docx_classic17_page_break
- Text Similarity: 1.0
- Visual Average: 0.9978
- Overall Score: 0.9991
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=876682 bytes, Reference=69855 bytes
Text content: ✅ Identical
docx_classic18_embedded_image
- Text Similarity: 1.0
- Visual Average: 0.9934
- Overall Score: 0.9974
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=876988 bytes, Reference=67381 bytes
Text content: ✅ Identical
docx_classic19_multiple_images
- Text Similarity: 1.0
- Visual Average: 0.9885
- Overall Score: 0.9954
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=878005 bytes, Reference=66233 bytes
Text content: ✅ Identical
docx_classic20_table_with_many_rows
- Text Similarity: 1.0
- Visual Average: 0.9652
- Overall Score: 0.9861
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=915751 bytes, Reference=95029 bytes
Text content: ✅ Identical
docx_classic21_nested_lists
- Text Similarity: 1.0
- Visual Average: 0.9932
- Overall Score: 0.9973
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=879647 bytes, Reference=98694 bytes
Text content: ✅ Identical
docx_classic22_horizontal_rule
- Text Similarity: 1.0
- Visual Average: 0.9929
- Overall Score: 0.9972
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=878545 bytes, Reference=70951 bytes
Text content: ✅ Identical
docx_classic23_mixed_formatting_runs
- Text Similarity: 1.0
- Visual Average: 0.9968
- Overall Score: 0.9987
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1105897 bytes, Reference=100389 bytes
Text content: ✅ Identical
docx_classic24_two_column_table_layout
- Text Similarity: 0.7907
- Visual Average: 0.9924
- Overall Score: 0.9132
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=880422 bytes, Reference=72170 bytes
Text Diff
--- minipdf/docx_classic24_two_column_table_layout.pdf
+++ reference/docx_classic24_two_column_table_layout.pdf
@@ -1,4 +1,5 @@
Two-Column Layout
-Left column content. This is the first column of Right column content. This is the second
-a two-column layout. It demonstrates how column. Both columns should render side-by-
-tables can be used for text layout purposes. side in the PDF output.
+Left column content. This is the first Right column content. This is the second
+column of a two-column layout. It column. Both columns should render side-
+demonstrates how tables can be used for by-side in the PDF output.
+text layout purposes.
docx_classic25_title_and_subtitle
- Text Similarity: 1.0
- Visual Average: 0.9938
- Overall Score: 0.9975
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=1111626 bytes, Reference=134268 bytes
Text content: ✅ Identical
docx_classic26_table_alignment
- Text Similarity: 1.0
- Visual Average: 0.9914
- Overall Score: 0.9966
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883658 bytes, Reference=94051 bytes
Text content: ✅ Identical
docx_classic27_long_paragraph_wrapping
- Text Similarity: 1.0
- Visual Average: 0.9358
- Overall Score: 0.9743
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=889346 bytes, Reference=72456 bytes
Text Diff
--- minipdf/docx_classic27_long_paragraph_wrapping.pdf
+++ reference/docx_classic27_long_paragraph_wrapping.pdf
@@ -1,23 +1,25 @@
Word Wrapping Test
-This is a very long paragraph designed to test how MiniPdf handles word wrapping across line
-boundaries. The text should flow naturally from one line to the next without any awkward breaks or
-overflow. This is a very long paragraph designed to test how MiniPdf handles word wrapping
-across line boundaries. The text should flow naturally from one line to the next without any
-awkward breaks or overflow. This is a very long paragraph designed to test how MiniPdf handles
-word wrapping across line boundaries. The text should flow naturally from one line to the next
-without any awkward breaks or overflow. This is a very long paragraph designed to test how
-MiniPdf handles word wrapping across line boundaries. The text should flow naturally from one
-line to the next without any awkward breaks or overflow. This is a very long paragraph designed to
-test how MiniPdf handles word wrapping across line boundaries. The text should flow naturally
-from one line to the next without any awkward breaks or overflow. This is a very long paragraph
-designed to test how MiniPdf handles word wrapping across line boundaries. The text should flow
-naturally from one line to the next without any awkward breaks or overflow. This is a very long
-paragraph designed to test how MiniPdf handles word wrapping across line boundaries. The text
-should flow naturally from one line to the next without any awkward breaks or overflow. This is a
-very long paragraph designed to test how MiniPdf handles word wrapping across line boundaries.
-The text should flow naturally from one line to the next without any awkward breaks or overflow.
-This is a very long paragraph designed to test how MiniPdf handles word wrapping across line
-boundaries. The text should flow naturally from one line to the next without any awkward breaks or
-overflow. This is a very long paragraph designed to test how MiniPdf handles word wrapping
-across line boundaries. The text should flow naturally from one line to the next without any
+This is a very long paragraph designed to test how MiniPdf handles word wrapping across
+line boundaries. The text should flow naturally from one line to the next without any
+awkward breaks or overflow. This is a very long paragraph designed to test how MiniPdf
+handles word wrapping across line boundaries. The text should flow naturally from one line
+to the next without any awkward breaks or overflow. This is a very long paragraph
+designed to test how MiniPdf handles word wrapping across line boundaries. The text
+should flow naturally from one line to the next without any awkward breaks or overflow.
+This is a very long paragraph designed to test how MiniPdf handles word wrapping across
+line boundaries. The text should flow naturally from one line to the next without any
+awkward breaks o
... (1251 more characters)
docx_classic28_special_characters
- Text Similarity: 1.0
- Visual Average: 0.9927
- Overall Score: 0.9971
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=884595 bytes, Reference=80991 bytes
Text content: ✅ Identical
docx_classic29_table_with_image
- Text Similarity: 0.9728
- Visual Average: 0.9899
- Overall Score: 0.9851
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=879641 bytes, Reference=70208 bytes
Text Diff
--- minipdf/docx_classic29_table_with_image.pdf
+++ reference/docx_classic29_table_with_image.pdf
@@ -1,5 +1,5 @@
Product Card
Product Description
-MiniPdf Widget - A compact, efficient tool for
-PDF conversion. Lightweight and dependency-
-free.
+MiniPdf Widget - A compact, efficient tool
+for PDF conversion. Lightweight and
+dependency-free.
docx_classic30_comprehensive_report
- Text Similarity: 1.0
- Visual Average: 0.7788
- Overall Score: 0.9115
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=1137160 bytes, Reference=220886 bytes
Text Diff
--- minipdf/docx_classic30_comprehensive_report.pdf
+++ reference/docx_classic30_comprehensive_report.pdf
@@ -8,9 +8,9 @@
4. Financial Overview
5. Recommendations
1. Executive Summary
-This report provides a comprehensive analysis of the technology landscape in 2026. Key findings
-include continued growth in AI adoption, increased focus on sustainability, and emerging trends in
-quantum computing.
+This report provides a comprehensive analysis of the technology landscape in 2026. Key
+findings include continued growth in AI adoption, increased focus on sustainability, and
+emerging trends in quantum computing.
2. Market Analysis
The following table summarizes market share across key sectors:
Sector Market Share Growth
docx_classic31_product_card_with_image
- Text Similarity: 1.0
- Visual Average: 0.9776
- Overall Score: 0.991
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885758 bytes, Reference=108260 bytes
Text content: ✅ Identical
docx_classic31_strikethrough_text
- Text Similarity: 1.0
- Visual Average: 0.9917
- Overall Score: 0.9967
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=881093 bytes, Reference=73919 bytes
Text content: ✅ Identical
docx_classic32_company_logo_header
- Text Similarity: 1.0
- Visual Average: 0.9546
- Overall Score: 0.9818
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=893070 bytes, Reference=112769 bytes
Text Diff
--- minipdf/docx_classic32_company_logo_header.pdf
+++ reference/docx_classic32_company_logo_header.pdf
@@ -1,8 +1,8 @@
ACME CORPORATION
123 Business Ave, Suite 500 | New York, NY 10001
Quarterly Report Q1 2026
-This report summarizes the activities and financial performance of ACME Corporation during the
-first quarter of 2026.
+This report summarizes the activities and financial performance of ACME Corporation
+during the first quarter of 2026.
Metric Target Actual
Revenue \$5M \$5.3M
Customers 500 520
docx_classic32_superscript_subscript
- Text Similarity: 0.9762
- Visual Average: 0.9958
- Overall Score: 0.9888
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=881354 bytes, Reference=73303 bytes
Text Diff
--- minipdf/docx_classic32_superscript_subscript.pdf
+++ reference/docx_classic32_superscript_subscript.pdf
@@ -1,10 +1,8 @@
Superscript and Subscript
2
Einstein's equation: E = mc
-Water: H O
-2
-Carbon dioxide: CO
-2
+Water: H 2 O
+Carbon dioxide: CO 2
1
Footnote reference
n n n
docx_classic33_highlighted_text
- Text Similarity: 1.0
- Visual Average: 0.9464
- Overall Score: 0.9786
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=876898 bytes, Reference=67420 bytes
Text content: ✅ Identical
docx_classic33_two_products_side_by_side
- Text Similarity: 1.0
- Visual Average: 0.9794
- Overall Score: 0.9918
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=895520 bytes, Reference=107718 bytes
Text content: ✅ Identical
docx_classic34_employee_directory_with_photo
- Text Similarity: 1.0
- Visual Average: 0.4892
- Overall Score: 0.6957
- Pages: MiniPdf=1, Reference=2
- File Size: MiniPdf=887688 bytes, Reference=99625 bytes
Text Diff
--- minipdf/docx_classic34_employee_directory_with_photo.pdf
+++ reference/docx_classic34_employee_directory_with_photo.pdf
@@ -10,4 +10,5 @@
Carol Williams
UX Designer
Email: carol@company.com
+---PAGE---
Department: Design
docx_classic34_paragraph_borders
- Text Similarity: 1.0
- Visual Average: 0.9809
- Overall Score: 0.9924
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=880554 bytes, Reference=69819 bytes
Text content: ✅ Identical
docx_classic35_inventory_with_product_photos
- Text Similarity: 1.0
- Visual Average: 0.9783
- Overall Score: 0.9913
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=894856 bytes, Reference=84431 bytes
Text content: ✅ Identical
docx_classic35_tab_stops
- Text Similarity: 0.64
- Visual Average: 0.9903
- Overall Score: 0.8521
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=884855 bytes, Reference=76285 bytes
Text Diff
--- minipdf/docx_classic35_tab_stops.pdf
+++ reference/docx_classic35_tab_stops.pdf
@@ -1,6 +1,6 @@
Tab Stop Alignment
-Chapter 1: Introduction.............................................................................................................................................................1
-Chapter 2: Getting Started......................................................................................................................................................5
-Chapter 3: Advanced Topics..............................................................................................................................................15
-Chapter 4: Best Practices.....................................................................................................................................................28
-Chapter 5: Conclusion............................................................................................................................................................35
+Chapter 1: Introduction ......................................................................................................................................................... 1
+Chapter 2: Getting Started..................................................................................................................................................... 5
+Chapter 3: Advanced Topics.............................................................................................................................................. 15
+Chapter 4: Best Practices .................................................................................................................................................... 28
+Chapter 5: Conclusion .......................................................................................................................................................... 35
docx_classic36_invoice_with_logo
- Text Similarity: 1.0
- Visual Average: 0.964
- Overall Score: 0.9856
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=272854 bytes, Reference=86303 bytes
Text content: ✅ Identical
docx_classic36_wide_table
- Text Similarity: 1.0
- Visual Average: 0.979
- Overall Score: 0.9916
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897665 bytes, Reference=100125 bytes
Text content: ✅ Identical
docx_classic37_nested_table
- Text Similarity: 0.963
- Visual Average: 0.9913
- Overall Score: 0.9817
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=882411 bytes, Reference=76696 bytes
Text Diff
--- minipdf/docx_classic37_nested_table.pdf
+++ reference/docx_classic37_nested_table.pdf
@@ -1,6 +1,6 @@
Nested Table Layout
Section A Section B
-Details: This cell contains plain text while the adjacent
-Item Qty cell has a nested table.
+Details: This cell contains plain text while the
+Item Qty adjacent cell has a nested table.
Widget 10
Gadget 5
docx_classic37_real_estate_listing
- Text Similarity: 1.0
- Visual Average: 0.9754
- Overall Score: 0.9902
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=887162 bytes, Reference=106313 bytes
Text content: ✅ Identical
docx_classic38_restaurant_menu_with_photos
- Text Similarity: 1.0
- Visual Average: 0.9748
- Overall Score: 0.9899
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=889593 bytes, Reference=104667 bytes
Text Diff
--- minipdf/docx_classic38_restaurant_menu_with_photos.pdf
+++ reference/docx_classic38_restaurant_menu_with_photos.pdf
@@ -1,12 +1,12 @@
The Garden Bistro
Fresh ingredients, timeless flavors
Appetizers
-Bruschetta \$8
-Caesar Salad \$10
+Bruschetta \$8
+Caesar Salad \$10
Main Courses
-Grilled Salmon \$22
-Filet Mignon \$32
+Grilled Salmon \$22
+Filet Mignon \$32
---PAGE---
Desserts
-Tiramisu \$10
-Chocolate Lava Cake \$12
+Tiramisu \$10
+Chocolate Lava Cake \$12
docx_classic38_table_column_widths
- Text Similarity: 0.8485
- Visual Average: 0.9821
- Overall Score: 0.9322
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=887917 bytes, Reference=102328 bytes
Text Diff
--- minipdf/docx_classic38_table_column_widths.pdf
+++ reference/docx_classic38_table_column_widths.pdf
@@ -1,10 +1,6 @@
Custom Column Widths
# Description Category Amount
-1 Office supplies and Operations \$245.00
-stationery
-2 Cloud hosting Technology \$1,200.00
-monthly fee
-3 Team lunch and Meals \$380.00
-catering
-4 Conference Travel \$599.00
-registration
+1 Office supplies and stationery Operations \$245.00
+2 Cloud hosting monthly fee Technology \$1,200.00
+3 Team lunch and catering Meals \$380.00
+4 Conference registration Travel \$599.00
docx_classic39_cover_page_with_image
- Text Similarity: 1.0
- Visual Average: 0.9866
- Overall Score: 0.9946
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=1112245 bytes, Reference=120726 bytes
Text content: ✅ Identical
docx_classic39_financial_report
- Text Similarity: 1.0
- Visual Average: 0.9783
- Overall Score: 0.9913
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=896442 bytes, Reference=111342 bytes
Text content: ✅ Identical
docx_classic40_product_catalog_with_images
- Text Similarity: 1.0
- Visual Average: 0.9729
- Overall Score: 0.9892
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=894996 bytes, Reference=117045 bytes
Text content: ✅ Identical
docx_classic40_resume
- Text Similarity: 1.0
- Visual Average: 0.9745
- Overall Score: 0.9898
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1134840 bytes, Reference=197726 bytes
Text Diff
--- minipdf/docx_classic40_resume.pdf
+++ reference/docx_classic40_resume.pdf
@@ -1,8 +1,8 @@
JOHN DOE
john.doe@email.com | +1-555-0100 | New York, NY
Professional Summary
-Experienced software engineer with 8+ years of expertise in building scalable web applications
-and distributed systems. Proficient in C#, Python, and JavaScript.
+Experienced software engineer with 8+ years of expertise in building scalable web
+applications and distributed systems. Proficient in C#, Python, and JavaScript.
Experience
Senior Software Engineer - Tech Corp Inc.
January 2020 - Present
docx_classic41_business_letter
- Text Similarity: 0.9893
- Visual Average: 0.9866
- Overall Score: 0.9904
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=267836 bytes, Reference=80009 bytes
Text Diff
--- minipdf/docx_classic41_business_letter.pdf
+++ reference/docx_classic41_business_letter.pdf
@@ -7,11 +7,11 @@
456 Commerce St
San Francisco, CA 94102
Dear Mr. Wilson,
-Thank you for your interest in our products. We are pleased to inform you that your order #ORD-
-2026-0315 has been processed and is scheduled for delivery by March 15, 2026.
-Please find enclosed the detailed invoice and shipping confirmation. If you have any questions
-regarding your order, please do not hesitate to contact our customer service team at
-support@acme.com or call us at +1-555-0200.
+Thank you for your interest in our products. We are pleased to inform you that your order
+#ORD-2026-0315 has been processed and is scheduled for delivery by March 15, 2026.
+Please find enclosed the detailed invoice and shipping confirmation. If you have any
+questions regarding your order, please do not hesitate to contact our customer service team
+at support@acme.com or call us at +1-555-0200.
We value your business and look forward to a continued partnership.
Sincerely,
Sarah Johnson
docx_classic41_newsletter_with_hero_image
- Text Similarity: 1.0
- Visual Average: 0.9217
- Overall Score: 0.9687
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=894862 bytes, Reference=132792 bytes
Text Diff
--- minipdf/docx_classic41_newsletter_with_hero_image.pdf
+++ reference/docx_classic41_newsletter_with_hero_image.pdf
@@ -1,11 +1,11 @@
TECH WEEKLY
Issue #42 | March 2026
AI-Powered Code Review Tools
-New AI-powered code review tools are transforming how development teams collaborate. Early
-adopters report 30% faster review cycles and 25% fewer bugs reaching production.
+New AI-powered code review tools are transforming how development teams collaborate.
+Early adopters report 30% faster review cycles and 25% fewer bugs reaching production.
Cloud Migration Best Practices
-As organizations accelerate cloud adoption, a structured migration approach becomes critical. We
-explore five proven strategies for seamless transitions.
+As organizations accelerate cloud adoption, a structured migration approach becomes
+critical. We explore five proven strategies for seamless transitions.
• Assess workload compatibility
• Plan data migration carefully
• Implement security from day one
docx_classic42_chart_image_with_data
- Text Similarity: 1.0
- Visual Average: 0.9755
- Overall Score: 0.9902
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=900614 bytes, Reference=112851 bytes
Text content: ✅ Identical
docx_classic42_meeting_minutes
- Text Similarity: 0.9903
- Visual Average: 0.9703
- Overall Score: 0.9842
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=900227 bytes, Reference=112714 bytes
Text Diff
--- minipdf/docx_classic42_meeting_minutes.pdf
+++ reference/docx_classic42_meeting_minutes.pdf
@@ -9,14 +9,15 @@
3. New Hire Onboarding
4. Action Items
Discussion Summary
-Alice presented the Q4 results showing a 15% revenue increase. Bob proposed reallocating 10%
-of the marketing budget to R&D. Carol reported that three new engineering positions have been
-approved.
+Alice presented the Q4 results showing a 15% revenue increase. Bob proposed reallocating
+10% of the marketing budget to R&D. Carol reported that three new engineering positions
+have been approved.
Action Items
Owner Action Due Date
Bob Submit revised budget March 10
proposal
Carol Post job listings for March 7
engineering roles
-David Prepare onboarding materials March 14
+David Prepare onboarding March 14
+materials
Eve Schedule follow-up meeting March 5
docx_classic43_event_flyer_with_banner
- Text Similarity: 1.0
- Visual Average: 0.9195
- Overall Score: 0.9678
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=889637 bytes, Reference=109856 bytes
Text content: ✅ Identical
docx_classic43_invoice_document
- Text Similarity: 0.9775
- Visual Average: 0.949
- Overall Score: 0.9706
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=280556 bytes, Reference=99728 bytes
Text Diff
--- minipdf/docx_classic43_invoice_document.pdf
+++ reference/docx_classic43_invoice_document.pdf
@@ -7,14 +7,18 @@
Bill To: Widget Industries Ship To: Same as billing
Item Description Qty Unit Price Total
SVC-001 Consulting 40 \$150.00 \$6,000.00
-Services (40 hrs)
+Services (40
+hrs)
LIC-002 Enterprise 5 \$499.00 \$2,495.00
-License (Annual)
-HW-003 Server Hardware 2 \$2,499.00 \$4,998.00
+License
+(Annual)
+HW-003 Server 2 \$2,499.00 \$4,998.00
+Hardware
SUP-004 Premium 1 \$1,800.00 \$1,800.00
Support Plan
-TRN-005 On-site Training 1 \$3,000.00 \$3,000.00
-(2 days)
+TRN-005 On-site 1 \$3,000.00 \$3,000.00
+Training (2
+days)
Subtotal: \$18,293.00
Tax (8.25%): \$1,509.17
Total Due: \$19,802.17
docx_classic44_dashboard_with_kpi_image
- Text Similarity: 1.0
- Visual Average: 0.9745
- Overall Score: 0.9898
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=901757 bytes, Reference=116297 bytes
Text content: ✅ Identical
docx_classic44_memo
- Text Similarity: 1.0
- Visual Average: 0.9768
- Overall Score: 0.9907
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=887407 bytes, Reference=129775 bytes
Text Diff
--- minipdf/docx_classic44_memo.pdf
+++ reference/docx_classic44_memo.pdf
@@ -1,16 +1,16 @@
MEMORANDUM
-TO: All Department Heads
-FROM: Maria Garcia, CEO
-DATE: March 3, 2026
-RE: Quarterly Performance Review Process Changes
-Effective immediately, we are implementing several changes to our quarterly performance review
-process. These changes are designed to streamline evaluation procedures and provide more
-actionable feedback to team members.
+TO: All Department Heads
+FROM: Maria Garcia, CEO
+DATE: March 3, 2026
+RE: Quarterly Performance Review Process Changes
+Effective immediately, we are implementing several changes to our quarterly performance
+review process. These changes are designed to streamline evaluation procedures and
+provide more actionable feedback to team members.
Key Changes
• Reviews will now be conducted bi-monthly instead of quarterly
• Self-assessment forms must be submitted 5 business days before the review
• 360-degree feedback will be incorporated for all managerial positions
• New rating scale: 1-5 (replacing the current A-F system)
• All reviews must be completed within a 2-week window
-Please share this information with your teams and direct any questions to the HR department at
-hr@company.com.
+Please share this information with your teams and direct any questions to the HR
+department at hr@company.com.
docx_classic45_certificate_with_seal
- Text Similarity: 1.0
- Visual Average: 0.9778
- Overall Score: 0.9911
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=261857 bytes, Reference=71997 bytes
Text content: ✅ Identical
docx_classic45_project_plan
- Text Similarity: 1.0
- Visual Average: 0.9546
- Overall Score: 0.9818
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=914849 bytes, Reference=128979 bytes
Text Diff
--- minipdf/docx_classic45_project_plan.pdf
+++ reference/docx_classic45_project_plan.pdf
@@ -1,8 +1,8 @@
Project Plan: Website Redesign
Project Manager: Sarah Chen | Start Date: March 2026
Project Overview
-This project aims to redesign the company website to improve user experience, modernize the
-visual design, and optimize for mobile devices.
+This project aims to redesign the company website to improve user experience, modernize
+the visual design, and optimize for mobile devices.
Timeline
Phase Task Owner Start End
Discovery User research & UX Team Mar 1 Mar 14
docx_classic46_comparison_table
- Text Similarity: 1.0
- Visual Average: 0.9777
- Overall Score: 0.9911
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897171 bytes, Reference=107695 bytes
Text content: ✅ Identical
docx_classic46_product_image_grid
- Text Similarity: 1.0
- Visual Average: 0.9801
- Overall Score: 0.992
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=879439 bytes, Reference=83076 bytes
Text content: ✅ Identical
docx_classic47_data_dictionary
- Text Similarity: 1.0
- Visual Average: 0.9511
- Overall Score: 0.9804
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=907028 bytes, Reference=114030 bytes
Text Diff
--- minipdf/docx_classic47_data_dictionary.pdf
+++ reference/docx_classic47_data_dictionary.pdf
@@ -6,7 +6,8 @@
identifier
name VARCHAR(100) NOT NULL Full name
email VARCHAR(255) UNIQUE Email address
-created_at DATETIME DEFAULT NOW() Account creation date
+created_at DATETIME DEFAULT NOW() Account creation
+date
status ENUM DEFAULT 'active' Account status
Table: orders
Column Type Constraints Description
docx_classic47_news_article_with_hero_image
- Text Similarity: 1.0
- Visual Average: 0.9222
- Overall Score: 0.9689
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1119480 bytes, Reference=131080 bytes
Text Diff
--- minipdf/docx_classic47_news_article_with_hero_image.pdf
+++ reference/docx_classic47_news_article_with_hero_image.pdf
@@ -1,10 +1,10 @@
Open Source Software Adoption Hits Record High
By Sarah Mitchell | March 6, 2026 | Technology
-A new survey shows that 92% of enterprise organizations now use open source software in their
-technology stacks, up from 78% in 2024. The trend is driven by cost savings, community
-innovation, and increased security transparency.
-Key findings include a 45% increase in open source contributions from corporate developers, and
-growing adoption of open source AI frameworks across industries.
+A new survey shows that 92% of enterprise organizations now use open source software in
+their technology stacks, up from 78% in 2024. The trend is driven by cost savings,
+community innovation, and increased security transparency.
+Key findings include a 45% increase in open source contributions from corporate
+developers, and growing adoption of open source AI frameworks across industries.
Industry Reactions
"Open source has become the backbone of modern software development," said CTO James
Walker. "Organizations that embrace it gain a competitive advantage."
docx_classic48_multi_level_headings
- Text Similarity: 1.0
- Visual Average: 0.9766
- Overall Score: 0.9906
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=1348069 bytes, Reference=118651 bytes
Text Diff
--- minipdf/docx_classic48_multi_level_headings.pdf
+++ reference/docx_classic48_multi_level_headings.pdf
@@ -54,9 +54,9 @@
Implementation uses factory pattern with dependency injection.
3.1.2 Sub-component
Detailed description of sub-component 3.1.2.
----PAGE---
3.1.2.1 Implementation Notes
Implementation uses factory pattern with dependency injection.
+---PAGE---
3.2 Component 2
Component 3.2 handles specific operations within Module 3.
3.2.1 Sub-component
docx_classic48_task_list_with_status_icons
- Text Similarity: 1.0
- Visual Average: 0.9743
- Overall Score: 0.9897
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=894146 bytes, Reference=106209 bytes
Text content: ✅ Identical
docx_classic49_cjk_document
- Text Similarity: 0.9849
- Visual Average: 0.9733
- Overall Score: 0.9833
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=4617235 bytes, Reference=141092 bytes
Text Diff
--- minipdf/docx_classic49_cjk_document.pdf
+++ reference/docx_classic49_cjk_document.pdf
@@ -1,16 +1,18 @@
CJK Text Sample
Chinese (Simplified)
-MiniPdf 是一个轻量级的 .NET 库,用于将 Office 文档转换为 PDF 格式。
-它不依赖 Microsoft Office,可以在任何平台上运行。
+MiniPdf 是一个 轻 量 级 的 .NET 库 ,用于将 Office 文档 转换为 PDF 格式。
+它不依 赖 Microsoft Office ,可以在任何平台上运行。
Chinese (Traditional)
MiniPdf 是一個輕量級的 .NET 庫,用於將 Office 文檔轉換為 PDF 格式。
Japanese
-MiniPdf は軽量な .NET ライブラリで、Office ドキュメントを PDF に変換します。
+MiniPdf は軽量な .NET ライブラリで、 Office ドキュメントを PDF に変換します。
Korean
MiniPdf 는 Office 문서를 PDF 로 변환하는 경량 .NET 라이브러리입니다 .
Mixed CJK Table
Language Hello Thank You
-Chinese 你好 谢谢
+Chinese 你好
+谢谢
Japanese こんにちは ありがとう
-Korean 안녕하세요 감사합니다
+Korean
+안녕하세요 감사합니다
English Hello Thank you
docx_classic49_wide_panoramic_banner
- Text Similarity: 1.0
- Visual Average: 0.8986
- Overall Score: 0.9594
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=888285 bytes, Reference=108687 bytes
Text content: ✅ Identical
docx_classic50_long_table_with_formatting
- Text Similarity: 0.7465
- Visual Average: 0.634
- Overall Score: 0.7522
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=949051 bytes, Reference=122324 bytes
Text Diff
--- minipdf/docx_classic50_long_table_with_formatting.pdf
+++ reference/docx_classic50_long_table_with_formatting.pdf
@@ -1,45 +1,63 @@
Server Inventory Report
Server ID Hostname IP Address OS Status
-SRV-001 server-001.local 10.0.0.1 Ubuntu 22.04 Online
-SRV-002 server-002.local 10.0.0.2 Windows Server Online
-2022
-SRV-003 server-003.local 10.0.0.3 RHEL 9 Online
-SRV-004 server-004.local 10.0.0.4 Debian 12 Maintenance
-SRV-005 server-005.local 10.0.0.5 CentOS Stream Online
-9
-SRV-006 server-006.local 10.0.0.6 Ubuntu 22.04 Online
-SRV-007 server-007.local 10.0.0.7 Windows Server Offline
-2022
-SRV-008 server-008.local 10.0.0.8 RHEL 9 Online
-SRV-009 server-009.local 10.0.0.9 Debian 12 Online
-SRV-010 server-010.local 10.0.0.10 CentOS Stream Online
-9
-SRV-011 server-011.local 10.0.0.11 Ubuntu 22.04 Online
-SRV-012 server-012.local 10.0.0.12 Windows Server Online
-2022
-SRV-013 server-013.local 10.0.0.13 RHEL 9 Online
-SRV-014 server-014.local 10.0.0.14 Debian 12 Maintenance
-SRV-015 server-015.local 10.0.0.15 CentOS Stream Online
-9
-SRV-016 server-016.local 10.0.0.16 Ubuntu 22.04 Online
-SRV-017 server-017.local 10.0.0.17 Windows Server Offline
-2022
-SRV-018 server-018.local 10.0.0.18 RHEL 9 Online
-SRV-019 server-019.local 10.0.0.19 Debian 12 Online
-SRV-020 server-020.local 10.0.0.20 CentOS Stream Online
-9
-SRV-021 server-021.local 10.0.0.21 Ubuntu 22.04 Online
-SRV-022 server-022.local 10.0.0.22 Windows Server Online
-2022
-SRV-023 server-023.local 10.0.0.23 RHEL 9 Online
-SRV-024 server-024.local 10.0.0.24 Debian 12 Maintenance
-SRV-025 server-025.local 10.0.0.25 CentOS Stream Online
-9
-SRV-026 server-026.local 10.0.0.26 Ubuntu 22.04 Online
-SRV-027 server-027.local 10.0.0.27 Windows Server Offline
-2022
-SRV-028 server-028.local 10.0.0.28 RHEL 9 Online
-SRV-029 server-029.local 10.0.0.29 Debian 12 Online
+SRV-001 server- 10.0.0.1 Ubuntu 22.04 Online
+001.local
+SRV-002 server- 10.0.0.2 Windows Online
+002.local Server 2022
+SRV-003 server- 10.0.0.3 RHEL 9 Online
+003.local
+SRV-004 server- 10.0.0.4 Debian 12 Maintenance
+004.local
+SRV-005 server- 10.0.0.5 CentOS Stream Online
+005.local 9
+SRV-006 server- 10.0.0.6 Ubuntu 22.04 Online
+006.local
+SRV-007 server- 10.0.0.7 Windows Offline
+007.local Server 2022
+SRV-008 server- 10.0.0.8 RHEL 9 Online
+008.local
+SRV-009 server- 10.0.0.9 Debian 12 Online
+009.local
+SRV-010 server- 10.0.0.10 CentOS Stream Online
+010.local 9
+SRV-011 server- 10.0.0.11 Ubuntu 22.04 Online
+011.local
+SRV-012 server- 10.0.0.12 Windows Online
+012.local Server 2022
+SRV-013 server- 10.0.0.13 RHEL 9 Online
+013.local
+SRV-014 server- 10.0.0.14 Debian 12 Maintenance
+014.local
+SRV-015 server- 10.0.0.15 CentOS Stream Online
+015.local 9
+SRV-016 server- 10.0.0.16 Ubuntu 22.04 Online
+016.local
+SRV-017 server- 10.0.0.17 Windows Offline
+017.local Server 2022
+SRV-018 server- 10.0.0.18 RHEL 9 Online
+018.local
+SRV-019 server- 10.0.
... (778 more characters)
docx_classic50_portrait_tall_image
- Text Similarity: 1.0
- Visual Average: 0.9808
- Overall Score: 0.9923
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1119479 bytes, Reference=156207 bytes
Text Diff
--- minipdf/docx_classic50_portrait_tall_image.pdf
+++ reference/docx_classic50_portrait_tall_image.pdf
@@ -3,8 +3,8 @@
Chief Technology Officer, NovaTech Inc.
Biography
Dr. Zhang is a recognized leader in distributed systems with over 15 years of experience in
-building large-scale cloud platforms. She holds a Ph.D. in Computer Science from MIT and has
-published 30+ papers on scalable architectures.
+building large-scale cloud platforms. She holds a Ph.D. in Computer Science from MIT and
+has published 30+ papers on scalable architectures.
Speaking Topics
• Microservices at Scale
• AI-Driven Infrastructure
docx_classic51_step_by_step_with_images
- Text Similarity: 1.0
- Visual Average: 0.9758
- Overall Score: 0.9903
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=892797 bytes, Reference=86405 bytes
Text content: ✅ Identical
docx_classic51_underline_styles
- Text Similarity: 1.0
- Visual Average: 0.9947
- Overall Score: 0.9979
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=877961 bytes, Reference=68896 bytes
Text content: ✅ Identical
docx_classic52_before_after_comparison
- Text Similarity: 1.0
- Visual Average: 0.9726
- Overall Score: 0.989
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883662 bytes, Reference=71180 bytes
Text content: ✅ Identical
docx_classic52_spacing_before_after
- Text Similarity: 1.0
- Visual Average: 0.9751
- Overall Score: 0.99
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883152 bytes, Reference=76163 bytes
Text content: ✅ Identical
docx_classic53_color_swatch_palette
- Text Similarity: 1.0
- Visual Average: 0.9723
- Overall Score: 0.9889
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885107 bytes, Reference=105872 bytes
Text content: ✅ Identical
docx_classic53_table_merged_complex
- Text Similarity: 1.0
- Visual Average: 0.9792
- Overall Score: 0.9917
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=891074 bytes, Reference=93385 bytes
Text content: ✅ Identical
docx_classic54_multi_font_family
- Text Similarity: 1.0
- Visual Average: 0.9859
- Overall Score: 0.9944
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1131700 bytes, Reference=250187 bytes
Text content: ✅ Identical
docx_classic54_travel_destination_cards
- Text Similarity: 1.0
- Visual Average: 0.9799
- Overall Score: 0.992
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=892304 bytes, Reference=105061 bytes
Text content: ✅ Identical
docx_classic55_background_shading_paragraph
- Text Similarity: 1.0
- Visual Average: 0.9366
- Overall Score: 0.9746
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=878677 bytes, Reference=67663 bytes
Text content: ✅ Identical
docx_classic55_lab_results_with_image
- Text Similarity: 1.0
- Visual Average: 0.9545
- Overall Score: 0.9818
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=902666 bytes, Reference=118015 bytes
Text content: ✅ Identical
docx_classic56_images_and_tables_mixed
- Text Similarity: 1.0
- Visual Average: 0.952
- Overall Score: 0.9808
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=894450 bytes, Reference=84307 bytes
Text content: ✅ Identical
docx_classic56_software_screenshot_features
- Text Similarity: 1.0
- Visual Average: 0.9692
- Overall Score: 0.9877
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897803 bytes, Reference=118476 bytes
Text content: ✅ Identical
docx_classic57_right_to_left_text
- Text Similarity: 0.9328
- Visual Average: 0.9897
- Overall Score: 0.969
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1061155 bytes, Reference=97228 bytes
Text Diff
--- minipdf/docx_classic57_right_to_left_text.pdf
+++ reference/docx_classic57_right_to_left_text.pdf
@@ -4,6 +4,6 @@
Right-aligned text (simulating RTL):
This text is right-aligned to simulate right-to-left layout.
Hebrew sample (RTL):
-םולש םלוע
+עולם שלום
Arabic sample (RTL):
-ﺎﺒﺣﺮﻣ ﻢﻟﺎﻌﻟﺎﺑ
+بالعالم مرحبا
docx_classic57_sports_results_with_logos
- Text Similarity: 1.0
- Visual Average: 0.9531
- Overall Score: 0.9812
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=901510 bytes, Reference=112026 bytes
Text content: ✅ Identical
docx_classic58_dense_paragraph_document
- Text Similarity: 1.0
- Visual Average: 0.7725
- Overall Score: 0.809
- Pages: MiniPdf=6, Reference=7
- File Size: MiniPdf=990790 bytes, Reference=107116 bytes
Text Diff
--- minipdf/docx_classic58_dense_paragraph_document.pdf
+++ reference/docx_classic58_dense_paragraph_document.pdf
@@ -1,245 +1,267 @@
Research Paper: Modern Software Engineering
Author: Dr. Jane Smith | Published: March 2026
Abstract
-Modern software engineering practices emphasize continuous integration, automated testing, and
-agile methodologies. The rapid evolution of cloud computing and containerization has transformed
-how teams build and deploy applications. Microservices architecture enables independent scaling
-and deployment of components. Modern software engineering practices emphasize continuous
-integration, automated testing, and agile methodologies. The rapid evolution of cloud computing
-and containerization has transformed how teams build and deploy applications. Microservices
-architecture enables independent scaling and deployment of components. Modern software
-engineering practices emphasize continuous integration, automated testing, and agile
-methodologies. The rapid evolution of cloud computing and containerization has transformed how
-teams build and deploy applications. Microservices architecture enables independent scaling and
-deployment of components.
-Modern software engineering practices emphasize continuous integration, automated testing, and
-agile methodologies. The rapid evolution of cloud computing and containerization has transformed
-how teams build and deploy applications. Microservices architecture enables independent scaling
-and deployment of components. Modern software engineering practices emphasize continuous
-integration, automated testing, and agile methodologies. The rapid evolution of cloud computing
-and containerization has transformed how teams build and deploy applications. Microservices
-architecture enables independent scaling and deployment of components. Modern software
-engineering practices emphasize continuous integration, automated testing, and agile
-methodologies. The rapid evolution of cloud computing and containerization has transformed how
-teams build and deploy applications. Microservices architecture enables independent scaling and
+Modern software engineering practices emphasize continuous integration, automated
+testing, and agile methodologies. The rapid evolution of cloud computing and
+containerization has transformed how teams build and deploy applications. Microservices
+architecture enables independent scaling and deployment of components. Modern software
+engineering practices emphasize continuous integration, automated testing, and agile
+methodologies. The rapid evolution of cloud computing and containerization has
+transformed how teams build and deploy applications. Microservices architecture enables
+independent scaling and deployment of components. Modern software engineering
+practices emphasize continuous integration, automated testing, and agile methodologies.
+The rapid evolution of cloud computing and containerization has t
... (38350 more characters)
docx_classic58_report_with_footer_image
- Text Similarity: 1.0
- Visual Average: 0.9544
- Overall Score: 0.9818
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1129533 bytes, Reference=136402 bytes
Text content: ✅ Identical
docx_classic59_numbered_and_bullet_mixed
- Text Similarity: 0.9948
- Visual Average: 0.9816
- Overall Score: 0.9906
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=893244 bytes, Reference=119386 bytes
Text Diff
--- minipdf/docx_classic59_numbered_and_bullet_mixed.pdf
+++ reference/docx_classic59_numbered_and_bullet_mixed.pdf
@@ -15,6 +15,6 @@
• Configure font embedding preferences
• Enable or disable image compression
Troubleshooting
-1. Verify .NET SDK installation with dotnet --version
-2. Clear NuGet cache if packages fail to restore
-3. Check file permissions on output directory
+6. Verify .NET SDK installation with dotnet --version
+7. Clear NuGet cache if packages fail to restore
+8. Check file permissions on output directory
docx_classic59_nutrition_label_with_image
- Text Similarity: 1.0
- Visual Average: 0.9718
- Overall Score: 0.9887
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=891874 bytes, Reference=109697 bytes
Text content: ✅ Identical
docx_classic60_comprehensive_styled_report
- Text Similarity: 1.0
- Visual Average: 0.9579
- Overall Score: 0.9832
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=1375911 bytes, Reference=244937 bytes
Text Diff
--- minipdf/docx_classic60_comprehensive_styled_report.pdf
+++ reference/docx_classic60_comprehensive_styled_report.pdf
@@ -4,8 +4,8 @@
March 2026
---PAGE---
Executive Summary
-Key Findings: The technology sector continues to show strong growth driven by AI adoption,
-cloud migration, and digital transformation initiatives.
+Key Findings: The technology sector continues to show strong growth driven by AI
+adoption, cloud migration, and digital transformation initiatives.
Market Overview
Sector Q4 2025 Q1 2026 Change
Cloud Computing \$180B \$195B +8.3%
@@ -24,11 +24,11 @@
• Edge computing gaining traction
Visual Summary
Growth indicator (placeholder):
----PAGE---
Recommendations
1. Accelerate AI integration strategy
2. Diversify cloud vendor dependencies
3. Invest in employee upskilling programs
+---PAGE---
4. Strengthen data governance frameworks
5. Evaluate emerging quantum computing capabilities
--- End of Report ---
docx_classic60_project_status_with_milestones
- Text Similarity: 1.0
- Visual Average: 0.9734
- Overall Score: 0.9894
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=903246 bytes, Reference=141002 bytes
Text content: ✅ Identical
docx_classic61_header_and_footer
- Text Similarity: 1.0
- Visual Average: 0.9662
- Overall Score: 0.9865
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883692 bytes, Reference=76328 bytes
Text Diff
--- minipdf/docx_classic61_header_and_footer.pdf
+++ reference/docx_classic61_header_and_footer.pdf
@@ -1,13 +1,13 @@
MiniPdf Benchmark Report
Header and Footer Test
-Section 1: This content appears between the header and footer. It demonstrates how MiniPdf
-handles page headers and footers in DOCX conversion.
-Section 2: This content appears between the header and footer. It demonstrates how MiniPdf
-handles page headers and footers in DOCX conversion.
-Section 3: This content appears between the header and footer. It demonstrates how MiniPdf
-handles page headers and footers in DOCX conversion.
-Section 4: This content appears between the header and footer. It demonstrates how MiniPdf
-handles page headers and footers in DOCX conversion.
-Section 5: This content appears between the header and footer. It demonstrates how MiniPdf
-handles page headers and footers in DOCX conversion.
+Section 1: This content appears between the header and footer. It demonstrates how
+MiniPdf handles page headers and footers in DOCX conversion.
+Section 2: This content appears between the header and footer. It demonstrates how
+MiniPdf handles page headers and footers in DOCX conversion.
+Section 3: This content appears between the header and footer. It demonstrates how
+MiniPdf handles page headers and footers in DOCX conversion.
+Section 4: This content appears between the header and footer. It demonstrates how
+MiniPdf handles page headers and footers in DOCX conversion.
+Section 5: This content appears between the header and footer. It demonstrates how
+MiniPdf handles page headers and footers in DOCX conversion.
Page 1 | Confidential
docx_classic62_footnote_references
- Text Similarity: 1.0
- Visual Average: 0.9763
- Overall Score: 0.9905
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885282 bytes, Reference=83417 bytes
Text content: ✅ Identical
docx_classic63_toc_style_headings
- Text Similarity: 1.0
- Visual Average: 0.9804
- Overall Score: 0.9922
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=575987 bytes, Reference=116889 bytes
Text Diff
--- minipdf/docx_classic63_toc_style_headings.pdf
+++ reference/docx_classic63_toc_style_headings.pdf
@@ -1,17 +1,22 @@
Table of Contents
Chapter 1: Introduction 1
-1.1 Background 3
-1.2 Objectives 5
+1.1 Background
+3
+1.2 Objectives
+5
Chapter 2: Literature Review 7
2.1 Historical Context 8
-2.2 Current Research 12
+2.2 Current Research
+12
2.2.1 Methodology
13
2.2.2 Findings
15
Chapter 3: Methodology 18
-3.1 Data Collection 19
-3.2 Analysis Framework 22
+3.1 Data Collection
+19
+3.2 Analysis Framework
+22
Chapter 4: Results 25
Chapter 5: Discussion 30
Chapter 6: Conclusion 35
docx_classic64_multi_column_layout
- Text Similarity: 0.8494
- Visual Average: 0.9755
- Overall Score: 0.93
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=891405 bytes, Reference=84036 bytes
Text Diff
--- minipdf/docx_classic64_multi_column_layout.pdf
+++ reference/docx_classic64_multi_column_layout.pdf
@@ -1,14 +1,15 @@
Newsletter - March 2026
-Artificial intelligence continues to reshape the Cloud computing has become the backbone of
-technology landscape. New advances in large modern enterprise IT. Multi-cloud strategies
-language models enable more natural human- allow organizations to leverage the best
-computer interaction. Companies worldwide features of different providers while avoiding
-are investing heavily in AI research and vendor lock-in. Edge computing supplements
-development, anticipating transformative cloud by processing data closer to its source,
-impacts across industries from healthcare to reducing latency for critical applications.
-manufacturing.
+Artificial intelligence continues to reshape Cloud computing has become the backbone
+the technology landscape. New advances in of modern enterprise IT. Multi-cloud
+large language models enable more natural strategies allow organizations to leverage
+human-computer interaction. Companies the best features of different providers
+worldwide are investing heavily in AI while avoiding vendor lock-in. Edge
+research and development, anticipating computing supplements cloud by
+transformative impacts across industries processing data closer to its source,
+from healthcare to manufacturing. reducing latency for critical applications.
Featured Article
-Open source software has become the foundation of modern software development. Projects like
-Linux, Kubernetes, and .NET have demonstrated how community-driven development can
-produce enterprise-grade software. The MiniPdf project itself is an example of this approach,
-providing PDF generation capabilities without proprietary dependencies.
+Open source software has become the foundation of modern software development.
+Projects like Linux, Kubernetes, and .NET have demonstrated how community-driven
+development can produce enterprise-grade software. The MiniPdf project itself is an
+example of this approach, providing PDF generation capabilities without proprietary
+dependencies.
docx_classic65_code_block_styling
- Text Similarity: 1.0
- Visual Average: 0.8727
- Overall Score: 0.9491
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=707101 bytes, Reference=91759 bytes
Text content: ✅ Identical
docx_classic66_colored_title_page
- Text Similarity: 1.0
- Visual Average: 0.9709
- Overall Score: 0.9884
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=490911 bytes, Reference=89358 bytes
Text content: ✅ Identical
docx_classic67_alternating_row_table
- Text Similarity: 0.9778
- Visual Average: 0.9277
- Overall Score: 0.9622
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=911827 bytes, Reference=115887 bytes
Text Diff
--- minipdf/docx_classic67_alternating_row_table.pdf
+++ reference/docx_classic67_alternating_row_table.pdf
@@ -4,10 +4,12 @@
Developer
E002 Bob Williams Marketing Marketing \$95,000
Manager
-E003 Carol Davis Finance Financial Analyst \$88,000
+E003 Carol Davis Finance Financial \$88,000
+Analyst
E004 David Brown Engineering Tech Lead \$135,000
E005 Emily Chen Design UX Designer \$92,000
-E006 Frank Miller Engineering Junior Developer \$75,000
+E006 Frank Miller Engineering Junior \$75,000
+Developer
E007 Grace Lee HR HR Specialist \$82,000
E008 Henry Wilson Engineering DevOps \$110,000
Engineer
docx_classic68_sidebar_layout
- Text Similarity: 0.9091
- Visual Average: 0.9578
- Overall Score: 0.9468
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=884237 bytes, Reference=129996 bytes
Text Diff
--- minipdf/docx_classic68_sidebar_layout.pdf
+++ reference/docx_classic68_sidebar_layout.pdf
@@ -2,17 +2,17 @@
Navigation
Overview
Overview
-MiniPdf is a lightweight .NET library for
+MiniPdf is a lightweight .NET library for converting Word and
Installation
-converting Word and Excel documents to PDF
+Excel documents to PDF format without requiring Microsoft
Configuration
-format without requiring Microsoft Office.
+Office.
API Reference
Key Features
FAQ
-DOCX to PDF conversion
+• DOCX to PDF conversion
Changelog
-XLSX to PDF conversion
-CJK font support
-Image embedding
-Table formatting
+• XLSX to PDF conversion
+• CJK font support
+• Image embedding
+• Table formatting
docx_classic69_blockquote_styling
- Text Similarity: 1.0
- Visual Average: 0.9769
- Overall Score: 0.9908
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1111975 bytes, Reference=110121 bytes
Text content: ✅ Identical
docx_classic70_academic_paper
- Text Similarity: 0.9924
- Visual Average: 0.9463
- Overall Score: 0.9755
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1142431 bytes, Reference=158286 bytes
Text Diff
--- minipdf/docx_classic70_academic_paper.pdf
+++ reference/docx_classic70_academic_paper.pdf
@@ -3,17 +3,17 @@
Department of Computer Science, University of Technology
Abstract
This paper surveys modern techniques for generating PDF documents from
-structured office formats. We evaluate the quality and performance of conversion
-tools across a benchmark suite of 90 test documents covering diverse formatting
-features.
+structured office formats. We evaluate the quality and performance of
+conversion tools across a benchmark suite of 90 test documents covering
+diverse formatting features.
1. Introduction
-PDF (Portable Document Format) remains the standard for sharing documents with consistent
-visual fidelity. Converting from editable office formats such as DOCX and XLSX to PDF requires
-careful handling of fonts, layouts, images, and styling attributes.
+PDF (Portable Document Format) remains the standard for sharing documents with
+consistent visual fidelity. Converting from editable office formats such as DOCX and XLSX to
+PDF requires careful handling of fonts, layouts, images, and styling attributes.
2. Methodology
-Our benchmark suite consists of 90 DOCX test files and 90 XLSX test files, each targeting specific
-formatting features. We compare output from MiniPdf against LibreOffice-generated reference
-PDFs using pixel-level similarity scoring.
+Our benchmark suite consists of 90 DOCX test files and 90 XLSX test files, each targeting
+specific formatting features. We compare output from MiniPdf against LibreOffice-
+generated reference PDFs using pixel-level similarity scoring.
3. Results
Feature Category Avg Score Sample Size
Basic Text 98.5% 20
@@ -21,8 +21,8 @@
Images 94.8% 15
Mixed Content 95.1% 30
4. Conclusion
-Modern lightweight PDF generation libraries can achieve high fidelity output for the majority of
-common document formatting features.
+Modern lightweight PDF generation libraries can achieve high fidelity output for the
+majority of common document formatting features.
References
[1] ISO 32000-2:2020. Document management - Portable document format.
[2] ECMA-376. Office Open XML File Formats.
docx_classic71_legal_document
- Text Similarity: 1.0
- Visual Average: 0.9722
- Overall Score: 0.9889
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=271217 bytes, Reference=91798 bytes
Text Diff
--- minipdf/docx_classic71_legal_document.pdf
+++ reference/docx_classic71_legal_document.pdf
@@ -1,6 +1,6 @@
SOFTWARE LICENSE AGREEMENT
-This Software License Agreement (the "Agreement") is entered into as of March 1, 2026, by and
-between MiniPdf Corporation ("Licensor") and the End User ("Licensee").
+This Software License Agreement (the "Agreement") is entered into as of March 1, 2026, by
+and between MiniPdf Corporation ("Licensor") and the End User ("Licensee").
1. GRANT OF LICENSE
1.1 Subject to the terms of this Agreement, Licensor grants Licensee a non-exclusive,
non-transferable license to use the Software.
docx_classic72_technical_specification
- Text Similarity: 1.0
- Visual Average: 0.9368
- Overall Score: 0.9747
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=914279 bytes, Reference=158590 bytes
Text content: ✅ Identical
docx_classic73_calendar_layout
- Text Similarity: 1.0
- Visual Average: 0.9512
- Overall Score: 0.9805
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=274943 bytes, Reference=68915 bytes
Text content: ✅ Identical
docx_classic74_org_chart
- Text Similarity: 1.0
- Visual Average: 0.9177
- Overall Score: 0.9671
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885291 bytes, Reference=102171 bytes
Text content: ✅ Identical
docx_classic75_newsletter_layout
- Text Similarity: 0.9857
- Visual Average: 0.9726
- Overall Score: 0.9833
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=893768 bytes, Reference=112463 bytes
Text Diff
--- minipdf/docx_classic75_newsletter_layout.pdf
+++ reference/docx_classic75_newsletter_layout.pdf
@@ -1,10 +1,11 @@
THE TECH WEEKLY
Issue #42 | March 6, 2026
AI Revolution in Software Development
-The integration of large language models into development workflows is transforming how teams
-write, review, and deploy code. Industry experts predict that AI-assisted development will become
-standard practice by 2027.
+The integration of large language models into development workflows is transforming how
+teams write, review, and deploy code. Industry experts predict that AI-assisted
+development will become standard practice by 2027.
Quick Updates
Cloud Infrastructure Open Source
-Major cloud providers announce new edge The .NET Foundation releases new guidelines
-computing regions in Asia-Pacific. for community project governance.
+Major cloud providers announce new edge The .NET Foundation releases new
+computing regions in Asia-Pacific. guidelines for community project
+governance.
docx_classic76_recipe_card
- Text Similarity: 1.0
- Visual Average: 0.9654
- Overall Score: 0.9862
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=889298 bytes, Reference=138002 bytes
Text content: ✅ Identical
docx_classic77_timeline_layout
- Text Similarity: 1.0
- Visual Average: 0.929
- Overall Score: 0.9716
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897577 bytes, Reference=120673 bytes
Text Diff
--- minipdf/docx_classic77_timeline_layout.pdf
+++ reference/docx_classic77_timeline_layout.pdf
@@ -1,8 +1,10 @@
Project Timeline: MiniPdf v2.0
-Q1 2025 Project Inception Initial requirements gathering
-and architecture design.
+Q1 2025 Project Inception Initial requirements
+gathering and architecture
+design.
Q2 2025 XLSX Support Implemented Excel-to-PDF
-conversion with chart support.
+conversion with chart
+support.
Q3 2025 Benchmark Suite Created automated
benchmark pipeline with 60
test cases.
docx_classic78_faq_document
- Text Similarity: 1.0
- Visual Average: 0.9527
- Overall Score: 0.9811
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=899038 bytes, Reference=128489 bytes
Text Diff
--- minipdf/docx_classic78_faq_document.pdf
+++ reference/docx_classic78_faq_document.pdf
@@ -1,8 +1,9 @@
Frequently Asked Questions
Find answers to the most common questions about MiniPdf.
Q1: What is MiniPdf?
-A: MiniPdf is a lightweight .NET library for converting DOCX and XLSX files to PDF without
-requiring Microsoft Office installation. It supports both DOCX and XLSX formats.
+A: MiniPdf is a lightweight .NET library for converting DOCX and XLSX files to PDF
+without requiring Microsoft Office installation. It supports both DOCX and XLSX
+formats.
Q2: Which .NET versions are supported?
A: .NET 8.0 and later versions are supported. The library targets .NET Standard 2.0 for
maximum compatibility across different .NET implementations.
@@ -10,30 +11,30 @@
A: Yes, MiniPdf supports embedded images in both DOCX and XLSX formats. Images are
converted and embedded in the output PDF with proper scaling.
Q4: How is the conversion quality measured?
-A: Quality is measured by pixel-level comparison against LibreOffice-generated reference
-PDFs. Each test case receives a similarity score from 0% to 100%.
+A: Quality is measured by pixel-level comparison against LibreOffice-generated
+reference PDFs. Each test case receives a similarity score from 0% to 100%.
Q5: Can MiniPdf handle CJK characters?
A: Yes, MiniPdf includes CJK font embedding support for Chinese, Japanese, and Korean
text in both DOCX and XLSX documents.
Q6: Is MiniPdf available on NuGet?
A: Yes, MiniPdf is published as a NuGet package and can be installed via dotnet add
package MiniPdf.
+---PAGE---
Q7: What table features are supported?
----PAGE---
-A: MiniPdf supports table borders, cell shading, merged cells, column widths, and alternating
-row colors in both DOCX and XLSX formats.
+A: MiniPdf supports table borders, cell shading, merged cells, column widths, and
+alternating row colors in both DOCX and XLSX formats.
Q8: How do I report a bug?
-A: Please open an issue on the GitHub repository with a minimal reproduction case and the
-expected vs actual output.
+A: Please open an issue on the GitHub repository with a minimal reproduction case and
+the expected vs actual output.
Q9: Does MiniPdf support headers and footers?
-A: MiniPdf does not currently render headers and footers from DOCX files. The content area
-of each page is fully supported including text, tables, and images.
+A: MiniPdf does not currently render headers and footers from DOCX files. The content
+area of each page is fully supported including text, tables, and images.
Q10: What is the maximum file size supported?
A: There is no hard limit on file size. MiniPdf processes files in a streaming fashion and
memory usage scales with document complexity rather than file size.
Q11: Can I use MiniPdf in a web application?
-A: Yes, MiniPdf works in any .NET environment including ASP.NET Core web applications,
-Azure Functions, and containerized services.
+A: Y
... (318 more characters)
docx_classic79_glossary
- Text Similarity: 1.0
- Visual Average: 0.969
- Overall Score: 0.9876
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=889249 bytes, Reference=112625 bytes
Text Diff
--- minipdf/docx_classic79_glossary.pdf
+++ reference/docx_classic79_glossary.pdf
@@ -1,10 +1,10 @@
Glossary of Terms
API
-Application Programming Interface. A set of protocols and tools for building software
-applications.
+Application Programming Interface. A set of protocols and tools for building
+software applications.
CI/CD
-Continuous Integration / Continuous Deployment. Practices for automating software
-delivery.
+Continuous Integration / Continuous Deployment. Practices for automating
+software delivery.
CJK
Chinese, Japanese, Korean. Refers to the character sets used in these languages.
DOCX
docx_classic80_matrix_grid
- Text Similarity: 0.9737
- Visual Average: 0.9595
- Overall Score: 0.9733
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=910469 bytes, Reference=111032 bytes
Text Diff
--- minipdf/docx_classic80_matrix_grid.pdf
+++ reference/docx_classic80_matrix_grid.pdf
@@ -10,9 +10,10 @@
implementation
PDF writer A R I I
implementation
-Benchmark suite C R A I
-creation
-CJK font support A R C I
+Benchmark C R A I
+suite creation
+CJK font A R C I
+support
Code review C R A I
Documentation A R C C
Release R I C A
docx_classic81_budget_table
- Text Similarity: 1.0
- Visual Average: 0.9228
- Overall Score: 0.9691
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=912472 bytes, Reference=129792 bytes
Text content: ✅ Identical
docx_classic82_survey_questionnaire
- Text Similarity: 0.9883
- Visual Average: 0.9575
- Overall Score: 0.9783
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=928932 bytes, Reference=123279 bytes
Text Diff
--- minipdf/docx_classic82_survey_questionnaire.pdf
+++ reference/docx_classic82_survey_questionnaire.pdf
@@ -3,7 +3,8 @@
Work Environment
Statement 1 2 3 4 5
My
-workspace is
+workspace
+is
comfortable
and well-
equipped.
@@ -18,22 +19,25 @@
Management
Statement 1 2 3 4 5
My manager
-provides clear
-direction and
+provides
+clear
+direction
+and
expectations.
I receive
regular and
constructive
feedback.
Management
-is transparent
+is
+transparent
about
company
goals.
Career Development
Statement 1 2 3 4 5
+I have
---PAGE---
-I have
opportunities
for
professional
@@ -48,10 +52,10 @@
advancement.
Work-Life Balance
Statement 1 2 3 4 5
-I can maintain
-a healthy
-work-life
-balance.
+I can
+maintain a
+healthy work-
+life balance.
Flexible work
arrangements
are available.
docx_classic83_medical_form
- Text Similarity: 1.0
- Visual Average: 0.9441
- Overall Score: 0.9776
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=908905 bytes, Reference=116392 bytes
Text content: ✅ Identical
docx_classic84_shipping_label
- Text Similarity: 1.0
- Visual Average: 0.9205
- Overall Score: 0.9682
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=342342 bytes, Reference=115136 bytes
Text content: ✅ Identical
docx_classic85_report_card
- Text Similarity: 1.0
- Visual Average: 0.9347
- Overall Score: 0.9739
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=285416 bytes, Reference=95268 bytes
Text content: ✅ Identical
docx_classic86_checklist_document
- Text Similarity: 1.0
- Visual Average: 0.9714
- Overall Score: 0.9886
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1036078 bytes, Reference=89619 bytes
Text content: ✅ Identical
docx_classic87_bibliography
- Text Similarity: 1.0
- Visual Average: 0.9746
- Overall Score: 0.9898
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1125271 bytes, Reference=141475 bytes
Text Diff
--- minipdf/docx_classic87_bibliography.pdf
+++ reference/docx_classic87_bibliography.pdf
@@ -3,12 +3,12 @@
Journal of Software Engineering, 42(3), 125-148.
[2] Chen, L., Wang, M., & Liu, X. (2024). Benchmarking PDF Generation Libraries: Quality and
Performance. ACM Computing Surveys, 57(1), 1-35.
-[3] Brown, A. (2025). Office Open XML: Architecture and Implementation. IEEE Transactions on
-Document Analysis, 28(4), 890-905.
+[3] Brown, A. (2025). Office Open XML: Architecture and Implementation. IEEE Transactions
+on Document Analysis, 28(4), 890-905.
[4] ISO (2020). ISO 32000-2:2020 Document Management - Portable Document Format.
International Organization for Standardization.
-[5] ECMA International (2016). ECMA-376: Office Open XML File Formats. ECMA International,
-5th Edition.
+[5] ECMA International (2016). ECMA-376: Office Open XML File Formats. ECMA
+International, 5th Edition.
[6] Davis, K., & Martinez, S. (2024). AI-Driven Code Review: Patterns and Anti-Patterns.
Proceedings of ICSE 2024, pp. 445-460.
[7] Taylor, P. (2025). CJK Font Embedding in Portable Documents. International Journal of
docx_classic88_presentation_handout
- Text Similarity: 1.0
- Visual Average: 0.6971
- Overall Score: 0.8788
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=898950 bytes, Reference=112210 bytes
Text Diff
--- minipdf/docx_classic88_presentation_handout.pdf
+++ reference/docx_classic88_presentation_handout.pdf
@@ -20,11 +20,11 @@
______________________________________________________________________
Slide 4: XLSX Support
Excel workbook processing and chart rendering.
----PAGE---
Notes:
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
+---PAGE---
Slide 5: Quality Assurance
Benchmark pipeline and self-evolution cycle.
Notes:
docx_classic89_multi_image_gallery
- Text Similarity: 1.0
- Visual Average: 0.9495
- Overall Score: 0.9798
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1116224 bytes, Reference=103275 bytes
Text Diff
--- minipdf/docx_classic89_multi_image_gallery.pdf
+++ reference/docx_classic89_multi_image_gallery.pdf
@@ -3,5 +3,5 @@
Steel Blue Crimson Lime Green
Orange Purple Turquoise
Gold Rosy Brown Cornflower
-Each image demonstrates a different color channel for testing image encoding fidelity in PDF
-output.
+Each image demonstrates a different color channel for testing image encoding fidelity in
+PDF output.
docx_classic90_comprehensive_annual_report
- Text Similarity: 1.0
- Visual Average: 0.9606
- Overall Score: 0.9842
- Pages: MiniPdf=4, Reference=4
- File Size: MiniPdf=1160402 bytes, Reference=198798 bytes
Text Diff
--- minipdf/docx_classic90_comprehensive_annual_report.pdf
+++ reference/docx_classic90_comprehensive_annual_report.pdf
@@ -11,9 +11,9 @@
Outlook & Strategy................................10
---PAGE---
Executive Summary
-Dear Stakeholders, We are pleased to present the annual report for MiniPdf Corporation. This
-year marked significant milestones in our product development, including the launch of
-DOCX-to-PDF conversion and expansion of our benchmark suite to 180 test cases.
+Dear Stakeholders, We are pleased to present the annual report for MiniPdf Corporation.
+This year marked significant milestones in our product development, including the launch
+of DOCX-to-PDF conversion and expansion of our benchmark suite to 180 test cases.
Key Metrics
Revenue Users Test Cases Quality Score
\$2.4M 12,500 180 97.2%
docx_classic91_landscape_page
- Text Similarity: 1.0
- Visual Average: 0.9776
- Overall Score: 0.991
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897350 bytes, Reference=109003 bytes
Text content: ✅ Identical
docx_classic92_first_line_indent
- Text Similarity: 0.9868
- Visual Average: 0.9635
- Overall Score: 0.9801
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885958 bytes, Reference=77578 bytes
Text Diff
--- minipdf/docx_classic92_first_line_indent.pdf
+++ reference/docx_classic92_first_line_indent.pdf
@@ -1,11 +1,12 @@
First-Line Indentation Example
-The quick brown fox jumps over the lazy dog. This paragraph demonstrates first-line
-indentation, a common typographic convention in printed books and formal documents to indicate
-the start of a new paragraph.
-Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt
-ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco
-laboris.
-In traditional typography, the first paragraph after a heading is often not indented, while
-subsequent paragraphs use a first-line indent of about 0.5 inches to visually separate them.
+The quick brown fox jumps over the lazy dog. This paragraph demonstrates first-
+line indentation, a common typographic convention in printed books and formal documents
+to indicate the start of a new paragraph.
+Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor
+incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud
+exercitation ullamco laboris.
+In traditional typography, the first paragraph after a heading is often not indented,
+while subsequent paragraphs use a first-line indent of about 0.5 inches to visually separate
+them.
This style of formatting creates a clean, professional appearance and helps readers
distinguish between paragraph boundaries without relying on extra spacing.
docx_classic93_hanging_indent
- Text Similarity: 1.0
- Visual Average: 0.9574
- Overall Score: 0.983
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=892256 bytes, Reference=93155 bytes
Text Diff
--- minipdf/docx_classic93_hanging_indent.pdf
+++ reference/docx_classic93_hanging_indent.pdf
@@ -5,10 +5,11 @@
Applications." Journal of Document Processing, 15(3), 234-251.
Williams, D. R. (2023). "Cross-Platform Document Conversion: Challenges and Solutions." In
Proceedings of the International Conference on Software Engineering, pp. 89-102.
-Chen, L., & Martinez, R. (2025). "Benchmarking Document Fidelity: A Comprehensive Framework
-for PDF Quality Assessment." ACM Computing Surveys, 58(1), Article 12.
-Anderson, K. M., Brown, T. P., & Davis, S. (2024). "Open-Source Approaches to Office Document
-Rendering Without Native Dependencies." Software: Practice and Experience, 54(7),
-1120-1145.
+Chen, L., & Martinez, R. (2025). "Benchmarking Document Fidelity: A Comprehensive
+Framework for PDF Quality Assessment." ACM Computing Surveys, 58(1), Article
+12.
+Anderson, K. M., Brown, T. P., & Davis, S. (2024). "Open-Source Approaches to Office
+Document Rendering Without Native Dependencies." Software: Practice and
+Experience, 54(7), 1120-1145.
Taylor, E. F. (2025). "CJK Font Embedding Strategies in Cross-Platform PDF Generators."
International Journal of Digital Typography, 8(2), 67-84.
docx_classic94_custom_bullet_characters
- Text Similarity: 1.0
- Visual Average: 0.9851
- Overall Score: 0.994
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1036888 bytes, Reference=117189 bytes
Text content: ✅ Identical
docx_classic95_contract_template
- Text Similarity: 1.0
- Visual Average: 0.9813
- Overall Score: 0.9925
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=268111 bytes, Reference=82650 bytes
Text content: ✅ Identical
docx_classic96_dense_data_table
- Text Similarity: 1.0
- Visual Average: 0.8705
- Overall Score: 0.9482
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=956370 bytes, Reference=126875 bytes
Text content: ✅ Identical
docx_classic97_product_catalog
- Text Similarity: 1.0
- Visual Average: 0.9367
- Overall Score: 0.9747
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=268145 bytes, Reference=84145 bytes
Text Diff
--- minipdf/docx_classic97_product_catalog.pdf
+++ reference/docx_classic97_product_catalog.pdf
@@ -11,6 +11,6 @@
rendering, and batch processing.
MiniPdf Enterprise
\$199/mo
-Complete enterprise solution with API access,
-custom branding, SLA guarantee, and
-dedicated support.
+Complete enterprise solution with API
+access, custom branding, SLA guarantee,
+and dedicated support.
docx_classic98_training_manual
- Text Similarity: 1.0
- Visual Average: 0.962
- Overall Score: 0.9848
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=975883 bytes, Reference=173553 bytes
Text Diff
--- minipdf/docx_classic98_training_manual.pdf
+++ reference/docx_classic98_training_manual.pdf
@@ -15,7 +15,8 @@
using MiniPdf;
var converter = new DocxToPdfConverter();
converter.Convert("input.docx", "output.pdf");
-WARNING: Large files may require additional memory. Set appropriate limits for production use.
+WARNING: Large files may require additional memory. Set appropriate limits for
+production use.
Chapter 3: Advanced Features
CJK Support: Enable Chinese, Japanese, and Korean font embedding for international
documents.
docx_classic99_policy_document
- Text Similarity: 1.0
- Visual Average: 0.9494
- Overall Score: 0.9798
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1130585 bytes, Reference=139982 bytes
Text Diff
--- minipdf/docx_classic99_policy_document.pdf
+++ reference/docx_classic99_policy_document.pdf
@@ -6,20 +6,20 @@
Reviewer A. Chen 2026-02-20
Approver D. Williams 2026-03-01
1. PURPOSE
-This policy establishes the information security requirements for all employees, contractors, and
-third-party users of MiniPdf Corporation systems and data.
+This policy establishes the information security requirements for all employees,
+contractors, and third-party users of MiniPdf Corporation systems and data.
2. SCOPE
This policy applies to all information assets, IT systems, and personnel who access, process,
store, or transmit company data regardless of format or location.
3. PASSWORD REQUIREMENTS
-All passwords must be at least 12 characters long and include uppercase letters, lowercase letters,
-numbers, and special characters. Passwords must be changed every 90 days.
+All passwords must be at least 12 characters long and include uppercase letters, lowercase
+letters, numbers, and special characters. Passwords must be changed every 90 days.
4. DATA CLASSIFICATION
-Data shall be classified as: Public, Internal, Confidential, or Restricted. Each classification level
-has specific handling requirements detailed in Appendix A.
+Data shall be classified as: Public, Internal, Confidential, or Restricted. Each classification
+level has specific handling requirements detailed in Appendix A.
5. INCIDENT REPORTING
-All security incidents must be reported within 24 hours to the Information Security Team. Failure to
-report incidents may result in disciplinary action.
+All security incidents must be reported within 24 hours to the Information Security Team.
+Failure to report incidents may result in disciplinary action.
6. COMPLIANCE
Violations of this policy may result in disciplinary action up to and including termination of
employment or contract.
docx_classic100_multi_page_table
- Text Similarity: 0.9932
- Visual Average: 0.8609
- Overall Score: 0.9416
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=992431 bytes, Reference=141154 bytes
Text Diff
--- minipdf/docx_classic100_multi_page_table.pdf
+++ reference/docx_classic100_multi_page_table.pdf
@@ -37,11 +37,12 @@
EMP-034 Grace Johnson HR Analyst Tokyo
EMP-035 Eva Johnson Marketing Senior Engineer San Francisco
EMP-036 Grace Jones Engineering Analyst London
----PAGE---
EMP-037 Alice Smith HR Analyst New York
EMP-038 Grace Smith Operations Manager New York
EMP-039 Eva Smith Engineering Coordinator London
EMP-040 Frank Davis Engineering Specialist New York
+---PAGE---
+ID Name Department Title Location
EMP-041 Bob Smith Operations Coordinator London
EMP-042 Iris Williams Marketing Specialist San Francisco
EMP-043 Alice Garcia HR Director New York
docx_classic101_warranty_document
- Text Similarity: 1.0
- Visual Average: 0.963
- Overall Score: 0.9852
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=892208 bytes, Reference=134098 bytes
Text Diff
--- minipdf/docx_classic101_warranty_document.pdf
+++ reference/docx_classic101_warranty_document.pdf
@@ -2,8 +2,8 @@
MiniPdf Software Products
WARRANTY COVERAGE
MiniPdf Corporation ("Company") warrants that the Software will perform substantially in
-accordance with the accompanying documentation for a period of one (1) year from the date of
-purchase ("Warranty Period").
+accordance with the accompanying documentation for a period of one (1) year from the
+date of purchase ("Warranty Period").
WHAT IS COVERED
• Defects in the conversion engine that cause incorrect output
• Font rendering issues for supported font families
@@ -15,7 +15,7 @@
• Third-party integration failures
• Cosmetic differences between source and output documents
LIMITATION OF LIABILITY
-IN NO EVENT SHALL THE COMPANY BE LIABLE FOR ANY INDIRECT,
-INCIDENTAL, SPECIAL, CONSEQUENTIAL, OR PUNITIVE DAMAGES ARISING
-OUT OF OR IN CONNECTION WITH THIS WARRANTY.
+IN NO EVENT SHALL THE COMPANY BE LIABLE FOR ANY INDIRECT, INCIDENTAL,
+SPECIAL, CONSEQUENTIAL, OR PUNITIVE DAMAGES ARISING OUT OF OR IN
+CONNECTION WITH THIS WARRANTY.
For warranty claims, contact: support@minipdf.example.com
docx_classic102_curriculum_syllabus
- Text Similarity: 0.9961
- Visual Average: 0.9448
- Overall Score: 0.9764
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=910412 bytes, Reference=150812 bytes
Text Diff
--- minipdf/docx_classic102_curriculum_syllabus.pdf
+++ reference/docx_classic102_curriculum_syllabus.pdf
@@ -3,9 +3,9 @@
Instructor
Dr. Jane Smith | jane.smith@university.edu | Office: CS Building 312
Course Description
-This course covers modern software engineering practices including agile methodologies, test-
-driven development, continuous integration, and software architecture patterns. Students will work
-in teams on a semester-long project.
+This course covers modern software engineering practices including agile methodologies,
+test-driven development, continuous integration, and software architecture patterns.
+Students will work in teams on a semester-long project.
Learning Objectives
1. Apply software design patterns to real-world problems
2. Implement CI/CD pipelines for automated testing
docx_classic103_event_program
- Text Similarity: 0.9917
- Visual Average: 0.9646
- Overall Score: 0.9825
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=1136876 bytes, Reference=147081 bytes
Text Diff
--- minipdf/docx_classic103_event_program.pdf
+++ reference/docx_classic103_event_program.pdf
@@ -5,7 +5,8 @@
Time Session Location
09:00 - 09:30 Registration & Coffee Lobby
09:30 - 10:30 Keynote: The Future of .NET Main Hall
-10:45 - 11:30 Building PDF Libraries in C# Room A
+10:45 - 11:30 Building PDF Libraries in Room A
+C#
11:45 - 12:30 AI-Powered Code Review Room B
12:30 - 14:00 Lunch Break Restaurant
14:00 - 14:45 Benchmark-Driven Room A
docx_classic104_sop_document
- Text Similarity: 0.9915
- Visual Average: 0.9759
- Overall Score: 0.987
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=904390 bytes, Reference=161569 bytes
Text Diff
--- minipdf/docx_classic104_sop_document.pdf
+++ reference/docx_classic104_sop_document.pdf
@@ -1,13 +1,14 @@
MiniPdf Corp SOP-QA-001 Rev: 3 Page 1 of 1
-Department: QA Effective: 2026-03-01 Author: J. Smith Approved: D. Lee
+Department: QA Effective: 2026-03- Author: J. Smith Approved: D. Lee
+01
STANDARD OPERATING PROCEDURE
Quality Assurance Testing Procedure
1. Purpose
-This SOP defines the standard process for running quality assurance benchmarks on the MiniPdf
-conversion engine.
+This SOP defines the standard process for running quality assurance benchmarks on the
+MiniPdf conversion engine.
2. Scope
-Applies to all QA engineers responsible for validating DOCX-to-PDF and XLSX-to-PDF conversion
-quality.
+Applies to all QA engineers responsible for validating DOCX-to-PDF and XLSX-to-PDF
+conversion quality.
3. Procedure
3.1 Prepare Test Files
Generate test documents using the benchmark generator scripts.
docx_classic105_certificate
- Text Similarity: 1.0
- Visual Average: 0.9717
- Overall Score: 0.9887
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=3976395 bytes, Reference=104606 bytes
Text content: ✅ Identical
docx_classic106_multi_section_orientation
- Text Similarity: 0.9933
- Visual Average: 0.9772
- Overall Score: 0.9882
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=913020 bytes, Reference=121828 bytes
Text Diff
--- minipdf/docx_classic106_multi_section_orientation.pdf
+++ reference/docx_classic106_multi_section_orientation.pdf
@@ -1,19 +1,20 @@
Section 1: Executive Summary (Portrait)
-This document demonstrates multiple sections with different page orientations. The first section
-uses portrait layout for narrative content.
-Key finding 1: The analysis shows significant improvement in document conversion quality metrics
-across all test categories.
-Key finding 2: The analysis shows significant improvement in document conversion quality metrics
-across all test categories.
-Key finding 3: The analysis shows significant improvement in document conversion quality metrics
-across all test categories.
+This document demonstrates multiple sections with different page orientations. The first
+section uses portrait layout for narrative content.
+Key finding 1: The analysis shows significant improvement in document conversion quality
+metrics across all test categories.
+Key finding 2: The analysis shows significant improvement in document conversion quality
+metrics across all test categories.
+Key finding 3: The analysis shows significant improvement in document conversion quality
+metrics across all test categories.
---PAGE---
Section 2: Data Analysis (Landscape)
Metric Jan Feb Mar Apr May Jun Avg
Quality Score 96% 99% 92% 85% 93% 93% 93%
Conversion 98% 92% 85% 97% 89% 97% 93%
Time
-File Size Ratio 86% 88% 97% 92% 90% 91% 90%
+File Size 86% 88% 97% 92% 90% 91% 90%
+Ratio
Error Rate 92% 96% 95% 89% 96% 88% 92%
---PAGE---
Section 3: Conclusions (Portrait)
docx_classic107_order_form
- Text Similarity: 0.9859
- Visual Average: 0.94
- Overall Score: 0.9704
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=283307 bytes, Reference=96453 bytes
Text Diff
--- minipdf/docx_classic107_order_form.pdf
+++ reference/docx_classic107_order_form.pdf
@@ -3,13 +3,15 @@
Vendor: MiniPdf Corp Ship To: Acme Inc
Payment: Net 30 Delivery: March 20, 2026
Item # Description Qty Unit Price Total
-001 MiniPdf Standard 5 \$29.00 \$145.00
+001 MiniPdf 5 \$29.00 \$145.00
+Standard
License
002 MiniPdf 2 \$199.00 \$398.00
Enterprise
License
003 Premium 1 \$500.00 \$500.00
-Support (Annual)
+Support
+(Annual)
004 Training 10 \$50.00 \$500.00
Workshop (Per
Seat)
docx_classic108_comparison_matrix
- Text Similarity: 0.9753
- Visual Average: 0.9626
- Overall Score: 0.9752
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1055038 bytes, Reference=118891 bytes
Text Diff
--- minipdf/docx_classic108_comparison_matrix.pdf
+++ reference/docx_classic108_comparison_matrix.pdf
@@ -1,18 +1,30 @@
PDF Library Comparison Matrix
Feature comparison across leading PDF generation libraries.
Feature MiniPdf LibreOffice Commercial
-XLSX to PDF ✓ ✓ ✓
-DOCX to PDF ✓ ✓ ✓
-No Office ✓ ✗ ✓
+XLSX to PDF
+✓ ✓ ✓
+DOCX to PDF
+✓ ✓ ✓
+No Office
+✓ ✗ ✓
Dependency
-CJK Font Support ✓ ✓ ✓
-Chart Rendering ✓ ✓ ✓
-Cross-Platform ✓ ✓ Varies
-Open Source ✓ ✓ ✗
-NuGet Package ✓ ✗ ✓
-Batch Processing ✓ ✓ ✓
-Custom Page Size ✓ ✓ ✓
-Free for ✓ ✓ ✗
+CJK Font Support
+✓ ✓ ✓
+Chart Rendering
+✓ ✓ ✓
+Cross-Platform Varies
+✓ ✓
+Open Source
+✓ ✓ ✗
+NuGet Package
+✓ ✗ ✓
+Batch Processing
+✓ ✓ ✓
+Custom Page Size
+✓ ✓ ✓
+Free for
+✓ ✓ ✗
Commercial
-Active Benchmark ✓ N/A Varies
+Active Benchmark N/A Varies
+✓
Suite
docx_classic109_release_notes
- Text Similarity: 1.0
- Visual Average: 0.9695
- Overall Score: 0.9878
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1129946 bytes, Reference=161252 bytes
Text content: ✅ Identical
docx_classic110_troubleshooting_guide
- Text Similarity: 1.0
- Visual Average: 0.9713
- Overall Score: 0.9885
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=896788 bytes, Reference=111746 bytes
Text content: ✅ Identical
docx_classic111_meeting_agenda
- Text Similarity: 0.9912
- Visual Average: 0.9646
- Overall Score: 0.9823
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1051205 bytes, Reference=130313 bytes
Text Diff
--- minipdf/docx_classic111_meeting_agenda.pdf
+++ reference/docx_classic111_meeting_agenda.pdf
@@ -12,8 +12,8 @@
Update
10:30 DOCX Benchmark Bob 15 min
Results Review
-10:45 Code Quality Metrics Alice 15 min
-Discussion
+10:45 Code Quality Alice 15 min
+Metrics Discussion
11:00 Upcoming Release Jane 20 min
Planning
11:20 Open Discussion & All 10 min
docx_classic112_project_status_report
- Text Similarity: 0.9798
- Visual Average: 0.9566
- Overall Score: 0.9746
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1291413 bytes, Reference=159646 bytes
Text Diff
--- minipdf/docx_classic112_project_status_report.pdf
+++ reference/docx_classic112_project_status_report.pdf
@@ -5,12 +5,16 @@
GREEN GREEN AMBER GREEN
Key Milestones
Milestone Target Date Status Notes
-DOCX Parser Feb 28 ✓ Done
+DOCX Parser Feb 28
+✓ Done
Complete
-Benchmark 90 Cases Mar 1 ✓ Done
+Benchmark 90 Mar 1
+✓ Done
+Cases
Benchmark 120 Mar 6 In Progress On track
Cases
-Quality Score >= 97% Mar 10 Pending Currently 95.8%
+Quality Score >= Mar 10 Pending Currently 95.8%
+97%
NuGet Package Mar 15 Pending Blocked on quality
Release
Risks & Issues
docx_classic113_address_labels
- Text Similarity: 1.0
- Visual Average: 0.9795
- Overall Score: 0.9918
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=891716 bytes, Reference=116233 bytes
Text content: ✅ Identical
docx_classic114_test_report
- Text Similarity: 1.0
- Visual Average: 0.9472
- Overall Score: 0.9789
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=908449 bytes, Reference=147821 bytes
Text content: ✅ Identical
docx_classic115_price_list
- Text Similarity: 0.9709
- Visual Average: 0.9118
- Overall Score: 0.9531
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=911248 bytes, Reference=123194 bytes
Text Diff
--- minipdf/docx_classic115_price_list.pdf
+++ reference/docx_classic115_price_list.pdf
@@ -4,7 +4,8 @@
Product 1-9 Seats 10-49 Seats 50+ Seats
MiniPdf Community Free Free Free
MiniPdf Standard \$29/mo \$24/mo \$19/mo
-MiniPdf Professional \$79/mo \$67/mo \$55/mo
+MiniPdf \$79/mo \$67/mo \$55/mo
+Professional
MiniPdf Enterprise \$199/mo \$169/mo \$149/mo
Support Plans
Product 1-9 Seats 10-49 Seats 50+ Seats
@@ -13,8 +14,10 @@
Dedicated Support \$200/mo \$170/mo \$140/mo
Professional Services
Product 1-9 Seats 10-49 Seats 50+ Seats
-Integration Consulting \$150/hr \$130/hr \$110/hr
-Custom Development \$200/hr \$175/hr \$150/hr
+Integration \$150/hr \$130/hr \$110/hr
+Consulting
+Custom \$200/hr \$175/hr \$150/hr
+Development
Training Workshop \$500/day \$425/day \$350/day
-Note: Volume discounts are applied automatically. Contact sales for enterprise agreements over
-100 seats.
+Note: Volume discounts are applied automatically. Contact sales for enterprise agreements
+over 100 seats.
docx_classic116_risk_assessment
- Text Similarity: 1.0
- Visual Average: 0.939
- Overall Score: 0.9756
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=914672 bytes, Reference=120292 bytes
Text content: ✅ Identical
docx_classic117_employee_handbook_excerpt
- Text Similarity: 1.0
- Visual Average: 0.9525
- Overall Score: 0.981
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1140979 bytes, Reference=179606 bytes
Text Diff
--- minipdf/docx_classic117_employee_handbook_excerpt.pdf
+++ reference/docx_classic117_employee_handbook_excerpt.pdf
@@ -2,19 +2,20 @@
MiniPdf Corporation | Revised March 2026
Chapter 5: Leave Policies
5.1 Annual Leave
-Full-time employees are entitled to 20 days of paid annual leave per calendar year. Leave accrues
-at a rate of 1.67 days per month of service.
+Full-time employees are entitled to 20 days of paid annual leave per calendar year. Leave
+accrues at a rate of 1.67 days per month of service.
Years of Service Annual Days Max Carry Over
0-2 years 20 days 5 days
3-5 years 23 days 7 days
6-10 years 25 days 10 days
10+ years 30 days 15 days
5.2 Sick Leave
-Employees are entitled to 10 days of paid sick leave per year. A medical certificate is required for
-absences of three or more consecutive days.
+Employees are entitled to 10 days of paid sick leave per year. A medical certificate is
+required for absences of three or more consecutive days.
5.3 Parental Leave
-Primary caregivers are entitled to 16 weeks of paid parental leave. Secondary caregivers receive 4
-weeks of paid leave. Leave must be taken within 12 months of the child's birth or adoption date.
+Primary caregivers are entitled to 16 weeks of paid parental leave. Secondary caregivers
+receive 4 weeks of paid leave. Leave must be taken within 12 months of the child's birth or
+adoption date.
Chapter 6: Code of Conduct
6.1 Professional Behavior
• Treat all colleagues with respect and dignity
docx_classic118_data_report_with_summary
- Text Similarity: 1.0
- Visual Average: 0.9337
- Overall Score: 0.9735
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=917782 bytes, Reference=130167 bytes
Text content: ✅ Identical
docx_classic119_multi_language_document
- Text Similarity: 0.9876
- Visual Average: 0.9692
- Overall Score: 0.9827
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=4390436 bytes, Reference=136786 bytes
Text Diff
--- minipdf/docx_classic119_multi_language_document.pdf
+++ reference/docx_classic119_multi_language_document.pdf
@@ -1,11 +1,11 @@
Multilingual Document Sample
This document demonstrates text rendering in multiple languages and scripts.
-English: The quick brown fox jumps over the lazy dog. MiniPdf converts documents with high
-fidelity.
+English: The quick brown fox jumps over the lazy dog. MiniPdf converts documents with
+high fidelity.
French: Le renard brun rapide saute par-dessus le chien paresseux.
German: Der schnelle braune Fuchs springt über den faulen Hund.
Spanish: El rápido zorro marrón salta sobre el perro perezoso.
-Japanese: こんにちは世界。MiniPdfはドキュメントを高品質で変換します。
+Japanese: こんにちは世界。 MiniPdf はドキュメントを高品質で変換します。
Special Characters & Symbols
Currency: £ € ¥ ₩ ₹ ₽
Math: ± × ÷ ≈ ≠ ≤ ≥ ∞ √ π
docx_classic120_comprehensive_business_proposal
- Text Similarity: 0.9918
- Visual Average: 0.9623
- Overall Score: 0.9816
- Pages: MiniPdf=3, Reference=3
- File Size: MiniPdf=1155542 bytes, Reference=200095 bytes
Text Diff
--- minipdf/docx_classic120_comprehensive_business_proposal.pdf
+++ reference/docx_classic120_comprehensive_business_proposal.pdf
@@ -16,13 +16,13 @@
8. Terms & Conditions..................................9
---PAGE---
1. Executive Summary
-MiniPdf Corporation proposes an enterprise document processing solution that eliminates the
-dependency on Microsoft Office for PDF generation. Our solution provides high-fidelity conversion
-of XLSX and DOCX files to PDF format with an average quality score of 97%.
+MiniPdf Corporation proposes an enterprise document processing solution that eliminates
+the dependency on Microsoft Office for PDF generation. Our solution provides high-fidelity
+conversion of XLSX and DOCX files to PDF format with an average quality score of 97%.
Key Benefits
Cost Reduction Performance Quality
-60% lower licensing costs 3x faster conversion with no 97%+ fidelity score verified by
-compared to traditional external dependencies automated benchmarks
+60% lower licensing costs 3x faster conversion with 97%+ fidelity score verified
+compared to traditional no external dependencies by automated benchmarks
solutions
2. Problem Statement
• Heavy dependency on Microsoft Office installations for PDF generation
@@ -30,8 +30,8 @@
• High licensing costs for enterprise-scale document processing
• Limited automation capabilities with traditional office suites
3. Proposed Solution
-MiniPdf is a lightweight .NET library that provides native document-to-PDF conversion without
-requiring Microsoft Office installation. Key features include:
+MiniPdf is a lightweight .NET library that provides native document-to-PDF conversion
+without requiring Microsoft Office installation. Key features include:
1. XLSX to PDF with chart and formula support
2. DOCX to PDF with full formatting preservation
3. CJK font auto-embedding for international documents
docx_classic121_thin_border_table
- Text Similarity: 1.0
- Visual Average: 0.9867
- Overall Score: 0.9947
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=888418 bytes, Reference=93903 bytes
Text content: ✅ Identical
docx_classic122_thick_outer_border_table
- Text Similarity: 1.0
- Visual Average: 0.9849
- Overall Score: 0.994
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=890019 bytes, Reference=98269 bytes
Text content: ✅ Identical
docx_classic123_dashed_border_table
- Text Similarity: 1.0
- Visual Average: 0.9922
- Overall Score: 0.9969
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=881006 bytes, Reference=94091 bytes
Text content: ✅ Identical
docx_classic124_colored_border_table
- Text Similarity: 1.0
- Visual Average: 0.9847
- Overall Score: 0.9939
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=884775 bytes, Reference=88322 bytes
Text content: ✅ Identical
docx_classic125_solid_cell_fills
- Text Similarity: 1.0
- Visual Average: 0.9725
- Overall Score: 0.989
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=885849 bytes, Reference=92426 bytes
Text content: ✅ Identical
docx_classic126_dark_header_table
- Text Similarity: 1.0
- Visual Average: 0.9803
- Overall Score: 0.9921
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=889943 bytes, Reference=101627 bytes
Text content: ✅ Identical
docx_classic127_font_styles_showcase
- Text Similarity: 1.0
- Visual Average: 0.9819
- Overall Score: 0.9928
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1336492 bytes, Reference=138431 bytes
Text content: ✅ Identical
docx_classic128_font_sizes_showcase
- Text Similarity: 1.0
- Visual Average: 0.9733
- Overall Score: 0.9893
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=881877 bytes, Reference=79780 bytes
Text Diff
--- minipdf/docx_classic128_font_sizes_showcase.pdf
+++ reference/docx_classic128_font_sizes_showcase.pdf
@@ -6,9 +6,9 @@
Font size 12pt — The quick brown fox jumps over the lazy dog.
Font size 14pt — The quick brown fox jumps over the lazy dog.
Font size 16pt — The quick brown fox jumps over the lazy dog.
-Font size 18pt — The quick brown fox jumps over the lazy
-dog.
-Font size 20pt — The quick brown fox jumps over the
+Font size 18pt — The quick brown fox jumps over the
lazy dog.
-Font size 24pt — The quick brown fox jumps
-over the lazy dog.
+Font size 20pt — The quick brown fox jumps over
+the lazy dog.
+Font size 24pt — The quick brown fox
+jumps over the lazy dog.
docx_classic129_alignment_combinations
- Text Similarity: 1.0
- Visual Average: 0.9882
- Overall Score: 0.9953
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883884 bytes, Reference=93307 bytes
Text content: ✅ Identical
docx_classic130_wrap_and_indent
- Text Similarity: 1.0
- Visual Average: 0.9824
- Overall Score: 0.993
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=886605 bytes, Reference=80252 bytes
Text Diff
--- minipdf/docx_classic130_wrap_and_indent.pdf
+++ reference/docx_classic130_wrap_and_indent.pdf
@@ -1,10 +1,10 @@
Text Wrapping and Indentation
Long Wrapping Text
-This is a long text that should wrap within the page margins when rendered to PDF. It tests the
-ability of the converter to handle text that flows across multiple lines without explicit line breaks.
-The paragraph continues with additional sentences to ensure sufficient length for wrapping.
-Modern document processing must handle variable-width fonts, kerning, and proper hyphenation
-to produce high-quality output.
+This is a long text that should wrap within the page margins when rendered to PDF. It tests
+the ability of the converter to handle text that flows across multiple lines without explicit
+line breaks. The paragraph continues with additional sentences to ensure sufficient length
+for wrapping. Modern document processing must handle variable-width fonts, kerning, and
+proper hyphenation to produce high-quality output.
Indented Paragraphs
Indent level 0
Indent level 1
docx_classic131_number_format_table
- Text Similarity: 1.0
- Visual Average: 0.9797
- Overall Score: 0.9919
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=893067 bytes, Reference=98702 bytes
Text content: ✅ Identical
docx_classic132_striped_table
- Text Similarity: 1.0
- Visual Average: 0.9339
- Overall Score: 0.9736
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=897965 bytes, Reference=103001 bytes
Text content: ✅ Identical
docx_classic133_gradient_rows_table
- Text Similarity: 1.0
- Visual Average: 0.9625
- Overall Score: 0.985
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=894739 bytes, Reference=96779 bytes
Text content: ✅ Identical
docx_classic134_heatmap_table
- Text Similarity: 1.0
- Visual Average: 0.9646
- Overall Score: 0.9858
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=898299 bytes, Reference=92444 bytes
Text content: ✅ Identical
docx_classic135_bottom_border_paragraphs
- Text Similarity: 1.0
- Visual Average: 0.9803
- Overall Score: 0.9921
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=880664 bytes, Reference=87812 bytes
Text content: ✅ Identical
docx_classic136_financial_statement
- Text Similarity: 1.0
- Visual Average: 0.9675
- Overall Score: 0.987
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=896402 bytes, Reference=114465 bytes
Text content: ✅ Identical
docx_classic137_checkerboard_table
- Text Similarity: 1.0
- Visual Average: 0.9571
- Overall Score: 0.9828
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=898314 bytes, Reference=63596 bytes
Text content: ✅ Identical
docx_classic138_color_grid_table
- Text Similarity: 1.0
- Visual Average: 0.9813
- Overall Score: 0.9925
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=877652 bytes, Reference=65776 bytes
Text content: ✅ Identical
docx_classic139_paragraph_shading_patterns
- Text Similarity: 1.0
- Visual Average: 0.8805
- Overall Score: 0.9522
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=882539 bytes, Reference=72758 bytes
Text content: ✅ Identical
docx_classic140_rotated_text_table
- Text Similarity: 0.8158
- Visual Average: 0.9742
- Overall Score: 0.916
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=883321 bytes, Reference=90797 bytes
Text Diff
--- minipdf/docx_classic140_rotated_text_table.pdf
+++ reference/docx_classic140_rotated_text_table.pdf
@@ -1,5 +1,7 @@
Rotated Text in Table
Note: Text rotation in table cells uses vertical text direction via the textDirection element.
Normal Bottom-to-Top Top-to-Bottom TB-LR-V TB-RL
-Rotated text Rotated text Rotated text Rotated text Rotated text
-sample sample sample sample sample
+Rotated text sample Rotated text sample Rotated text sample
+Rotated text
+sample
+Rotated text sample
docx_classic141_mixed_border_styles
- Text Similarity: 1.0
- Visual Average: 0.9903
- Overall Score: 0.9961
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=881939 bytes, Reference=90728 bytes
Text content: ✅ Identical
docx_classic142_styled_invoice_document
- Text Similarity: 1.0
- Visual Average: 0.9306
- Overall Score: 0.9722
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=275985 bytes, Reference=94931 bytes
Text content: ✅ Identical
docx_classic143_multi_section_document
- Text Similarity: 1.0
- Visual Average: 0.8782
- Overall Score: 0.9513
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=270950 bytes, Reference=80334 bytes
Text content: ✅ Identical
docx_classic144_note_style_paragraphs
- Text Similarity: 1.0
- Visual Average: 0.986
- Overall Score: 0.9944
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=1110236 bytes, Reference=110519 bytes
Text content: ✅ Identical
docx_classic145_status_badge_table
- Text Similarity: 1.0
- Visual Average: 0.9703
- Overall Score: 0.9881
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=895150 bytes, Reference=110142 bytes
Text content: ✅ Identical
docx_classic146_double_border_table
- Text Similarity: 1.0
- Visual Average: 0.9805
- Overall Score: 0.9922
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=887202 bytes, Reference=96549 bytes
Text content: ✅ Identical
docx_classic147_multi_section_styled_report
- Text Similarity: 1.0
- Visual Average: 0.9299
- Overall Score: 0.972
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=906537 bytes, Reference=115750 bytes
Text content: ✅ Identical
docx_classic148_data_grid_document
- Text Similarity: 1.0
- Visual Average: 0.894
- Overall Score: 0.9576
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=930182 bytes, Reference=111668 bytes
Text content: ✅ Identical
docx_classic149_merged_section_report
- Text Similarity: 1.0
- Visual Average: 0.9226
- Overall Score: 0.969
- Pages: MiniPdf=1, Reference=1
- File Size: MiniPdf=274876 bytes, Reference=84151 bytes
Text content: ✅ Identical
docx_classic150_kitchen_sink_styles
- Text Similarity: 1.0
- Visual Average: 0.9559
- Overall Score: 0.9824
- Pages: MiniPdf=2, Reference=2
- File Size: MiniPdf=1363725 bytes, Reference=159873 bytes
Text Diff
--- minipdf/docx_classic150_kitchen_sink_styles.pdf
+++ reference/docx_classic150_kitchen_sink_styles.pdf
@@ -9,9 +9,9 @@
Font rendering Pass 98% Excellent fidelity
Table borders Pass 95% Minor thick-border
gap
-Cell shading Pass 97% All fill types supported
-Image embedding Pass 96% PNG and JPEG
-tested
+Cell shading Pass 97% All fill types
+supported
+Image embedding Pass 96% PNG and JPEG tested
Page layout Pass 94% Landscape needs
work
Double border below this paragraph
Improvement Suggestions
⚠ Low-Score Test Cases (below 0.8)
- docx_classic34_employee_directory_with_photo (score: 0.6957)
- docx_classic50_long_table_with_formatting (score: 0.7522)
- docx_classic13_long_document (score: 0.7749)
Review the text diffs and visual comparisons above to identify specific rendering issues.














































































































































































































































































































































































































































