Aspose.Words for Python via .NET 22.8 Release Notes

Major Features

There are 143 improvements and fixes in this regular monthly release. The most notable are:

  • Published API Reference
  • AZW3 (also called KF8) file format is now supported for import and export.
  • Implemented support for cell spacing in tables upon rendering.
  • Improved font substitution for symbols.
  • Improved the rotation algorithm of axis labels of the charts.

Full List of Issues Covering all Changes in this Release (Reported by .NET Users)

Key Summary Category
WORDSNET-3895 Support export to Kindle and Mobipocket formats New Feature
WORDSNET-7013 Support  “Use Word 2003 indent rules for text next to wrapped objects” compatibility option New Feature
WORDSNET-11557 ‘Allow Spacing between cells’ option of a Table is not preserved during rendring New Feature
WORDSNET-14915 Table width and alignment issues New Feature
WORDSNET-23555 Provide Aspose.Words for Python 3.10 New Feature
WORDSNET-23839 Export Phonetic Guide content to TXT New Feature
WORDSNET-24104 Inaccurate font fallback for chess pawn symbol when saving to PDF New Feature
WORDSNET-5256 Part of content is moved to the previous page during rendering Enhancement
WORDSNET-9749 Document.UpdateFields does not update the TOC field page numbers Enhancement
WORDSNET-11815 VML objects not supported during Html to Word conversion Enhancement
WORDSNET-12694 NumPages field returns incorrect value Enhancement
WORDSNET-20239 LayoutCollector incorrectly returns page index Enhancement
WORDSNET-22594 508 /WCAG Compliance word document Enhancement
WORDSNET-23999 Send warning about accessibility issues when saving to PDF/A and PDF/UA Enhancement
WORDSNET-24015 Fix critical SonarQube issues Enhancement
WORDSNET-24046 Add an option to specify path where Aspose.Words should look for harfbuzz.dll Enhancement
WORDSNET-1043 Table layout is incorrect Bug
WORDSNET-3626 Border style of a table is missed during rendering Bug
WORDSNET-3692 Table width is incorrect during rendering Bug
WORDSNET-4106 Custom table borders are changed during rendering Bug
WORDSNET-4794 Support cell spacing in layout Bug
WORDSNET-5143 Spacing between lines is bigger after rendering Bug
WORDSNET-5606 Borders around table are lost during rendering Bug
WORDSNET-5926 Cellspacing not working for PDF while using similar code snippet Bug
WORDSNET-7014 Extra page is incorrectly printed at the end of first Page Bug
WORDSNET-7203 Cell spacing between outer thick border and Table Cells is not preserved in PDF Bug
WORDSNET-7204 AutoShape is not aligned with the Table in PDF Bug
WORDSNET-7263 Contents moved to previous page in output PDF file Bug
WORDSNET-8101 Spacing between Cells is not respected in fixed page formats Bug
WORDSNET-8304 Cell spacing/Rendered Table column widths differ in PDF Bug
WORDSNET-8442 Spacing between Cells is not respected in fixed page formats Bug
WORDSNET-9368 Page wraps at wrong place due to cell spacing in the table Bug
WORDSNET-9936 Width of table’s cells is changed after conversion from DOCX to PDF Bug
WORDSNET-10849 Contents position is changed after conversion from DOC to PDF Bug
WORDSNET-10958 Table’s cell wrapping issue while converting HTML to PDF Bug
WORDSNET-11192 DOCX to PDF conversion issue with table’s cell spacing Bug
WORDSNET-11372 Text does not render correctly in output PDF Bug
WORDSNET-11932 DOCX to PDF conversion issue with table rendering Bug
WORDSNET-12053 Cell spacing is lost after conversion from DOC to PDF Bug
WORDSNET-12070 Spacing is lost after conversion from DOC to HTML/MHTML/SVG/ePub Bug
WORDSNET-12601 Tables have incorrect widths in PDF Bug
WORDSNET-12693 Document.PageCount returns incorrect value Bug
WORDSNET-12703 Table size reduced in PDF after Word to PDF conversion Bug
WORDSNET-12818 Content moves to previous pages in PDF Bug
WORDSNET-13120 Vertical spacing of empty rows is reduced in PDF Bug
WORDSNET-13317 Spacing between table cells not always taken into account correctly during rendering Bug
WORDSNET-13892 Table borders lost during rendering to PDF Bug
WORDSNET-13893 Table cell spacing/wrapping incorrect during rendering to PDF Bug
WORDSNET-13937 Paragraph is rendered on previous page in output PDF Bug
WORDSNET-13987 A table row moves to the previous Page in PDF Bug
WORDSNET-14188 Cell borders are missing for table in PDF Bug
WORDSNET-14446 ‘border-spacing’ value is ignored during conversion from HTML to PDF Bug
WORDSNET-15071 Allow spacing between cells is lost after DOTX to PDF conversion Bug
WORDSNET-16089 Table’s cell is disappeared after calling UpdateTableLayout in PDF Bug
WORDSNET-16170 The default table cell spacing ignored Bug
WORDSNET-16414 Table position and height is changed after conversion from DOCX to PDF Bug
WORDSNET-16447 A Paragraph renders at the end of previous page in PDF Bug
WORDSNET-16562 Table incorrect after convert to PDF Bug
WORDSNET-16740 Incorrect Table Layout in PDF Bug
WORDSNET-17498 Table cell content being wrapped up on generating PDF Bug
WORDSNET-17665 The rendered output is missing one page Bug
WORDSNET-17807 DOCX to PDF conversion issue with images position Bug
WORDSNET-17808 DOCX to PDF - table not same Bug
WORDSNET-18523 Vertical spacing of some elements differ in PDF Bug
WORDSNET-18532 Table borders missing when saving to PDF Bug
WORDSNET-18539 Document.PageCount returns incorrect value Bug
WORDSNET-18594 A paragraph moves to the end of previous page in PDF Bug
WORDSNET-18601 DOCX to PDF conversion issue with cell’s spacing Bug
WORDSNET-18960 PDF output different than MS Word PDF output Bug
WORDSNET-19119 Text is shifted up to previous page in output PDF Bug
WORDSNET-19415 List item’s indent is incorrect after DOCX-HTML-DOCX roundtrip Bug
WORDSNET-19736 Incorrect scaling of the vertical numeric axis when converting to PDF Bug
WORDSNET-19737 Incorrect minimum of the vertical numeric axis when converting to PDF Bug
WORDSNET-19738 Axis labels text is rendered in two lines when converting to PDF Bug
WORDSNET-19953 Table width is changed after DOCX to PDF conversion Bug
WORDSNET-20161 Page numbers in TOC are not updating correctly Bug
WORDSNET-21063 Preserve 5px outset red border around DIV during HTML to Word Conversion Bug
WORDSNET-21311 DOCX to PDF conversion layout issue Bug
WORDSNET-21490 Table Conversion Issue on docx to PDF Bug
WORDSNET-21834 Table is not rendered properly when exported to PDF Bug
WORDSNET-22199 Spacing between table cells are lost after DOCX to PDF conversion Bug
WORDSNET-22430 A Paragraph moves to previous page in PDF Bug
WORDSNET-22695 The file is empty after load and save with words sdk Bug
WORDSNET-22737 Image is lost after DOCX to PDF conversion Bug
WORDSNET-22774 Table border issues when converting DOCX Bug
WORDSNET-22792 Paragraphs in wrong format in saved document Bug
WORDSNET-22866 UWP application does not compile in release mode Bug
WORDSNET-22904 Chinese document layout differs from MS Word Bug
WORDSNET-22912 The table border displayed not correctly after converting from DOCX to PDF Bug
WORDSNET-23183 H3 tag doesnt support CSS classes when converted from HTML to Word Bug
WORDSNET-23247 X-Axis labels are improperly rotated Bug
WORDSNET-23479 Content overflows on next page after adding watermark Bug
WORDSNET-23494 Order of paragraphs is changed after open save WML document Bug
WORDSNET-23619 Intervals between words are too big in some cells in RTL table Bug
WORDSNET-23620 Last symbols from first column are moved to the second column in RTL table Bug
WORDSNET-23664 Text in category labels is wrapped improperly Bug
WORDSNET-23666 Spacing between cells is lost upon rendering Bug
WORDSNET-23680 REF field is lost after updating fields and saving document to Fixed Page formats Bug
WORDSNET-23765 Checkboxes are imported improperly from ODT Bug
WORDSNET-23805 FileCorruptedException is thrown upon loading MHTML document Bug
WORDSNET-23814 Text shaper factory failed to return text shaper exception on Linux Bug
WORDSNET-23815 Additional revision in the output document Bug
WORDSNET-23881 InvalidOperationException is thrown upon updating fields Bug
WORDSNET-23891 PdfRenderer should not reference internal libraries in release builds Bug
WORDSNET-23901 StackOverflowException on MHTML loading Bug
WORDSNET-23917 InvalidOperationException at FootnoteCorrector.MergeLines Bug
WORDSNET-23935 MS Word fails to load a DOCX document generated by Aspose.Words Bug
WORDSNET-23949 Table layout is distorted after rendering Bug
WORDSNET-24018 Revisions reports no groups while there is a gorup Bug
WORDSNET-24026 VML WordArt’s width is incorrect Bug
WORDSNET-24028 FileCorruptedException is thrown upon loading RTF document Bug
WORDSNET-24031 Incorrect RTL table and table content positioning Bug
WORDSNET-24032 Part of content is missed and text wrapping is changed after importing PDF Bug
WORDSNET-24042 PDF with tables is imported improperly Bug
WORDSNET-24043 RTL content is imported improperly from PDF Bug
WORDSNET-24044 EQ field is exported incorrectly to TXT Bug
WORDSNET-24049 Cross references are not working after CXP updates Bug
WORDSNET-24051 Hyphenated text is wrapped improperly Bug
WORDSNET-24053 Style Separator is not inserted if add an empty section before building the document Bug
WORDSNET-24064 FileCorruptedException is thrown upon loading ODT document Bug
WORDSNET-24066 Bullet symbols are missing when saving to PDF with UseCoreFonts option Bug
WORDSNET-24086 Bullet symbols are missing when saving to PDF with UseCoreFonts option Bug
WORDSNET-24087 Bookmark itself is removed after setting it’s text to empty string Bug
WORDSNET-24098 The document appears to be corrupted - exception when trying to open DOCX Bug
WORDSNET-24099 Consider providing an ability to get font of the current LayoutEnumerator entity Bug
WORDSNET-24101 DOCX to RTF conversion error Bug
WORDSNET-24128 InvalidOperationException on DOCX roundtrip Bug

Full List of Issues Covering all Changes in this Release (Reported by Java Users)

Key Summary Category
WORDSNET-15723 Table border is lost after conversion from DOCX to PDF Bug
WORDSNET-16850 DOC to PDF Number of Pages Decreased Bug
WORDSNET-17448 Table layout changes during PDF conversion for a Hebrew file Bug
WORDSNET-19219 TOC page number issue after Section having Landscape orientation and Section break Bug
WORDSNET-21008 PDF is losing the table cell style after DOCX to PDF conversion Bug
WORDSNET-21009 Incorrect rendering of Table spanning across multiple TextColumns Bug
WORDSNET-21050 Spacing between Table Cells not preserved during DOCX to PDF conversion Bug
WORDSNET-23360 DOCX to PDF: Table rendered incorrectly Bug
WORDSNET-23630 Part of content is moved to previous page Bug
WORDSNET-23705 Row is moved to previous page upon rendering Bug
WORDSNET-23860 NullReferenceException is thrown upon rendering document Bug
WORDSNET-23861 Position of endnotes is incorrect after rendering Bug
WORDSNET-23975 Font size is changed after rendering document Bug
WORDSNET-24052 Line moved to the next page Bug
WORDSNET-24088 InvalidOperationException when converting MHTML to DOC Bug
WORDSNET-24106 WordML to DOCX: Missing content Bug

Public API and Backward Incompatible Changes

This section lists public API changes that were introduced in Aspose.Words 22.8. It includes not only new and obsoleted public methods, but also a description of any changes in the behavior behind the scenes in Aspose.Words which may affect existing code. Any behavior introduced that could be seen as regression and modifies the existing behavior is especially important and is documented here.

Added support for AZW3 export

Related issue: WORDSNET-3895

Aspose.Words now can export documents to AZW3 file format.

AZW3 (also called KF8) is Amazon Kindle’s proprietary e-book file format. AZW3 is essentially a compiled EPUB.

The following publicly visible enum value were added:

The use cases for saving a document to AZW3 file format are as follows:

doc = aw.Document("in.docx")
doc.save("out.azw3")
or
doc = aw.Document("in.docx")
options = aw.saving.HtmlSaveOptions(aw.SaveFormat.AZW3)
doc.save("out.azw3", options)

Added warnings for accessibility issues when saving to PDF/A and PDF/UA

Related issue: WORDSNET-23999

PDF/A and PDF/UA format imposes several accessibility requirements related to the document content that cannot be fulfilled during automatic conversion from a document in Word format to PDF. See documentation article for more info. Now warnings are issued for some of these problems. The warnings are issued when saving to one of the PDF/A or PDF/UA formats and the issue violates the compliance. E.g. the warning about missing document title will be issued when saving to PDF/UA and will not be issued when saving to PDF/A.

All warnings are of WarningType.MINOR_FORMATTING_LOSS and WarningSource.PDF. Here is the list of new warnings Description values:

  • “Document title is missing. This violates the compliance requirements. The output document will not be fully compliant.”
  • “The document contains headings which levels are not consecutive. This violates the compliance requirements. The output document will not be fully compliant.”
  • “There are shapes without alt text in the document. This violates the compliance requirements. The output document will not be fully compliant.”
  • “There are tables without alt text in the document. This violates the compliance requirements. The output document will not be fully compliant.”
  • “There are hyperlinks without alt text in the document. This violates the compliance requirements. The output document will not be fully compliant.”
  • “There are tables without header row/column in the document. This violates the compliance requirements. The output document will not be fully compliant.”
  • “The document contains Unicode PUA characters. This violates the compliance requirements. The output document will not be fully compliant.”
  • “The document contains .notdef glyphs. This violates the compliance requirements. The output document will not be fully compliant.”

Removed obsolete property HtmlSaveOptions.export_text_box_as_svg

Related issue: WORDSNET-23734

The obsolete property HtmlSaveOptions.export_text_box_as_svg has been removed. Please use the HtmlSaveOptions.export_shapes_as_svg property instead.